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TRANSGENIC MICE CONTAINING TARGETED GENE DISRUFOONS 

Field of the Invenlioii 

The present invention relates to transgenic animals, compositions and meftods relating to the 
characterization of gene function. 
5 Background of the Invention 

Experimental animal models are inoportant tools for understanding the role of genes. More 
particularly, (he ability to develop animals with specific genes altered or inactivated has been 
invaluable to the study of gene ftmction, and has lead to imexpected discoveries of a gene and/or 
mechanisms responsible for disease with similar manifestations in humans. These genetically 
10 engineered animals are also useful for identifying and testing therapeutic treatments for a variety of 
diseases and disorders. 

The identification of the function of numerous genes has been useful in ascertaining the roles 
of these genes in disease. Because of the high level of homology between humans and mice, for 
exanq)le, it is possible to define the function of individual human genes by making targeted germline 

15 mutations m selected genes in the animal. The phenotype of the resulting mutant animal can be used 
to help define the phenotype in humans. 

Several interesting genes have recenfly been discovered belonging to various families 
encoding G-protein coupled receptors (GPCRs), LDL receptors, cerberus and cerberus-like proteins, 
and proteins involved m carbohydrate metabolism, particularly those involved in epithelial develop- 

20 ment. Identifying the roles of these genes and their expression products may permit the definition of 
disease pathways and the identification of diagnostically and therapeutically useful targets. 

GPCRs have been characterized as having seven putative transmembrane domains (desig- 
nated TMl, TM2, TM3, TM4, TM5, TM6, and TM7) which are beUeved to represent transmembrane 
a-helices connected by extracellular or cytoplasmic loops. Most G-protein coupled receptors have 

25 single conserved cysteme residues in each of the first two extracellular loops which form disulfide 
bonds that are believed to stabilize functional protem structure. G-protein coupled receptors can be 
intracellularly coupled by het^otrimeric G-proteins to various intracellular enzymes, ion channels and 
transporters. Different G-protein a-subunits preferentially stimulate particular effectors to modulate 
various biological functions in a cell. 

30 Platelet-activating factor (PAF) has been implicated as a mediator in diverse pathologic 

processes, such as allergy, asthma, septic shock, arterial flirombosis, and inflarmnatory processes 
(Prescott etal, 7. Biol Chem. 265:17381-17384 (1990)). PAF is a phospholipid (l-O-alkyl-2-acetyl- 
sn-glycero-3-phosphorylcholine) and exerts its various effects via specific cell surface receptors fliat 
bind PAF witii high affinity. Using a guinea pig probe, Seyfiried et al (Genomics 13: 832-834 (1992)) 

35 isolated the gene for human PAF receptor (PTAFR). The coding sequence contains no mtrons. The 
encoded protein is highly homologous to flie guinea pig PAF receptor (82% identity) and contains 7 
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putative transiiiembraae domains. The PAF receptor (PAFR) therefore appears to be a member of the 
G protem-coupled family of receptors and exhibits significant similarity to many members of this 
family. By analysis of rodent/human somatic cell hybrids. Seyfried et al concluded that the PTAFR 
gene is located on hmnan chromosome L 
5 Recently, murine ESTs were identified (accession AA170490; GI: 1749042; accession 

AA162789; GI: 1738456) bearing sequence similarity to human platelet activating receptor homolog 
(accession NM 013308; GI;7019400). Additionally, a murine EST was identified (accession 
AA274I12; GI: 1912557) bearing sequence similarity to a human GPCR (KIAAOOOl; Nomura et al,, 
DNA Res 2(4):210 (1995)) and to human platelet-activating factor receptor. The start of the EST 

10 corresponds to approxhnately position 85 of PAF receptor which is in the 1st extracellular loop. The 
strongest homology is to KIAAOOOl, which is a randondy identified GPCR sequence with similarity 
to anaphylatoxin C5a receptor and PAF receptor. Over the past 15 years, nearly 350 therapeutic 
agents targeting 7 transmembrane receptors have been successfiilly introduced onto the market As 
these receptors have an established, proven history as therapeutic targets, a clear need exists for 

15 identification and characterization of GPCRs which can play a role in preventmg, ameliorating or 
correcting dysfimctions or diseases. 

Epidermal growth factor (EGF) has a characteristic structure, which coirprises three disulfide 
bridges that are essential for the protein's growth-stimulatuig activity. Many other proteins, however, 
have similar EGF domains (EGF-lihe domains). These proteins include both growth factors and other 

20 proteins having functions unrelated to EGF. (See, e,g,. Van Zoelen et al,. Vitamins and Hormones 
59:99-131 (2000)). 

Calcium bmding EGF-like domains are present in many extracellular proteins that perform a 
diverse range of functions. These proteins include, for example, fibrillin-1, Notch-3, protem S, fector 
IX, and tire low-density lipoprotem (LDL) receptor. (See, e.g., Smalhidge et al. Journal of 

25 Molecular Biology 286(3):661-668 (1999)). 

The LDL receptor family is known to bind and mtemalize apoE-rich lipoprotems and thought 
to play a role in lipoprotein metabolism. {See, e,g., Bujo, Nippon rinsho 57(12):2690-2695 (1999); 
Gliemann, Biological Chemistry 379:951-964 (1998)). Mutations cause familial 
hypercholesterolemia and premature coronary artery disease. LDL receptor-related proteins play an 

30 important role m the clearance of plasma-activated alpha 2-acroglobuIin and apolipoprotem E- 

enriched lipoproteios. It is essential for fetal development and has been associated with Alzheimer's 
disease (Hussain, et al., Ann, Rev. Nutr. 19: 141-72 (1999)). Recent studies have shown that this 
receptor femily is ircportant in neural development, suggesting that the LDL receptor family 
coxrprises multifimctional receptors involved in intracellular signal transduction, neuron migration, 

35 vascular smooth muscle cell proliferation, and lipoprotein mcorpomtion. {See, e.g., Bujo (1999), 
supra). 
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Recently, an expressed sequence tag (EST) (GenBank Accession No. AA482431) having 
EGF-Uke domains and showing similarities to LDL receptor-related proteins was identified. The gene, 
termed low-density lipoproteui receptor related protem 5 (LRP5), has been implicated as a candidate 
for conferring susceptibility to diabetes based on its chromosomal localization within the insuUn- 
dependent diabetes mellitus (IDDM) locus {see Hey, etal, Gene 216:103-111 (1998)). 

Cerberus and cerbems-like proteins con:5)rise a protein family expressed in the embryo, that 
are able to induce, enhance, and/or inhibit the formation, growth, and maintenance of neurons and 
related neural cells. The cerberus gene encodes a secreted protein that is expressed in the anterior 
endomesodenn, and when microinjected into Xenopus embryos, induces ectopic heads, and dupli- 
cated hearts and livers. {See, Bouwmeester et aU Nature 382(6592):595-601 (1996)). The 
cerberus-like (cerrl) gene was subsequently isolated in mice, and encodes a novel secreted protein 
specifically expressed in the anterior visceral endoderm during early gastrulation. (See, e.g., Belo et 
id,. Mechanisms of Development 68:45-57 (1997)). In Xenopus assays, cer-1 acts as a potent 
neutralizmg factor that induces forebrain markers and endoderm, but is unable to induce ectopic head- 
like structures as cerberus. Cer-1 encodes a putative secreted protein that is 48% identical to cerberus 
over a 110-amino acid region. {See, e.g., Shawlot et aU Proceedings of the National Academy of 
Sciences, USA 95(ll):6198-6203 (1998)). These proteins appear to have a 9 cysteme residue pattern. 
{See, e,g., U.S. Patent No. 5,935,852). 

The cerberus protem has also shown to be a multifunctional antagonist of Nodal, BMP, and 
Wnt signals. Specifically, the cerberus protein functions as a multivalent growtii factor antagonist in 
the extracellular space, bmding to Nodal, BMP, and Wnt proteins through independent sites. {See, 
€,g.. Piccolo et al. Nature 382 (6592):595-601 (1996)). Reported studies indicated that cerberus does 
not have a receptor or a dedicated transduction pathway, but acts as an extracellular inhibitor. {See, 
e.g,, Agius et al.. Journal de la Societe de biologie 193:347-354 (1999)). 

The human cerberus protem has been isolated and reported {See, e.g. , PCT Patent 
Publication No.: WO 9849296). As described in the art, these protems are capable of inducing 
endodermal, cardiac, and neural tissue development in vertebrates when expressed, and may be useful 
in applications that requke regeneration, differentiation, or repau: of tissues, such as wound repak, 
neuronal regeneration or transplantation, supplementation of heart muscle differentiation, and 
differentiation of tiie pancreas. (5ee,^.g., PCT Patent Publication No.: WO 9748275). These 
proteins may also be useful m the treatment of BMP-related disorders. {See, e.g., WO 9849296). 
Recenfly, an ejq)ressed sequence tag (EST) (GenBank Accession No. AA120122) bearing high 
similarities to cerberus and cerberus-like genes was identified. 

Successful development of fertile eggs involves the direct interaction between germline cells 
and follicle cells. {See, e.g., McLaren and Wyhe, Current Problems in Germ Cell Differentiation, 
Cambridge University Press (1992)). Epitiielial morphogenesis and tiie development of tiie follicular 
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epithelium and gennline differentiation that occurs in oogenesis have been studied well in Drosophila 
melanogaster, (See, e.g., King, Ovarian Development In Drosophila Melanogaster, New York, 
Academic Press, (1970)). More particularly, the monolayer follicular epithelium has developmental, 
morphological, and molecular properties of vertebrate epithelia, and develops in concert with the 
5 differentiating germline during oogenesis in Drosophila Melanogaster. (See, e.g., Goode et al.. 
Development 122:3863-3879 (1996)). 

Several classes of cell adhesion molecules are essential for epithelial development in both 
invertebrates and vertebrates. (See, e.g., Guinbiner etal, Cell 69:385-387 (1992)). For exanq)le, e- 
cadhedrin cell adhesion molecules are required for the formation and maintenance of epithelial 

10 structure m vertebrates and Drosophila. (See, e.g., Takeichie, Development 102:639-655 (1995); 
Tepass etal. Genes Dev. 10:672-685 (1996); and Uemural a/.. Genes Dev. 10:659-671 (1996)). 
Receptor tyrosine kmases and their ligands are cell surface factors that are also necessary for 
epithelial development. (See, e.g„ Nalduii et d., EMBO J. 10:2867-2878 (199 1); Miettinen et al. 
Nature 376:337-341 (1995); Luetteke et al, Cell 73:249-261 (1993); Schubach, Cell 49:699-707 

15 (1987); Goode et al. Dev. Biol 178:35-50 (1996)). 

Neurogenic genes encode another class of molecules important in the development of the 
epithelium Drosophila neurogenic genes, mcluding. Notch (n), Delta, neuralized, and Enhancer split 
are necessary for development of epithelial characteristics in embryonic tissue. (See, e.g., Artavanis- 
Tsakonas et al.. Science 268:225-232 (1995); Hartenstem et al. Development 1 16: 1203-1220 (1992); 

20 Coffinau etal.. Cell 73:659-671 (1993); Ruohola et al. Cell 66:1-20 (1991); and Xu et al. 

Development 1 15:913-922 (1992)). Although these Drosophila neurogenic genes contribute to 
epithelial characteristics in embryonic tissue, litde is known about thek role in follicular epithelium 
morphogenesis. 

One neurogenic gene, in particular, known as brainiac has shown to play a specific role in 
25 epithelial development. The brainiac gene has been reported to encode a novel, putative secreted 
protem requined in the germlme for establishmg the follicular epiflielium and for determining its 
dorsal-ventral polarity. (See, e.g,, Goode et al, (1992) and Goode et al, Dev. Biol (1996) supra). 
More specifically, the brainiac gene is present on the X chromosome and encodes a 325 amino acid 
protein with a putative signal sequence. The brainiac gene is expressed constitutively^in the germline 
30 during the first 12 hours of embryogenesis. (See, e.g., Goode, S. et al (1992) supra; Goode et al. 

Dev. Biol (1996) supra; and Goode et al. Development (1996) supra) Mutations m the brainiac gene 
demonstrate synergistic genetic interactions with mutations in signaling molecules, including, 
transforming growdi factor alpha (TGF-a) and epidermal growth factor receptor (Egfr). TGF-a and 
Egfr mutant mice suffer firom multi-organ failure due to widespread in^airment of epithelial 
35 development (See, e.g. , Miettinen et al, (1995) supra). 
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Brainiac genes: are m5)ortant for correct development of the follicular epithelium. Brainiac 
mutant females and thek offspring have multiple defects including ventralization of the eggshell, gaps 
in the follicular epithelium, and multiple layers of follicle cells around oocytes. In the absence of 
brainiac gene function, less efficiency in foUicular epithelium formation occurs, resulting in a failure 
to individuate germline cysts. Once the follicular epithelium is formed, brainiac genes are essential 
for maintenance of its epithelial characteristics. In addition, disruption in brainiac germline function 
results in loss of apicalbasal polarity and an accumulation of multiple layers of follicle cells, 
particularly around the oocyte. Brainiac is essential for efficient migration and maintenance of border 
cells and main body epithelial cells moving towards the oocyte during the late phases of oogenesis. 
{See^ e.g.^ Goode et aL, Development {1996) supra). An additional gene known as egghead has 
shown to function similarly in epithelial development. Moreover, both genes appear to be compo- 
nents of novel signaling pathways essential for epithelial development and maintenance. Most 
recently, an e;q)ressed sequence tag (EST) was isolated bearing sequence similarity to genes encoding 
neurogenic secreted signaling proteins, and more particularly to the bramiac gene (GenBank 
Accession No.: AA133340; EST name: zn92h01.sl). 

Summary of the Invention 

The present mvention generally relates to transgenic ammals, as well as to conopositions and 
methods relating to the characterization of gene function. 

The present invention provides transgenic cells comprising a disruption in a targeted gene. 
The transgenic cells of the present invention are comprised of any cells capable of undergoing 
homologous recombination. Preferably, the cells of the present invention are stem cells and more 
prefembly, embryonic stem (ES) cells, and most preferably, murine ES cells. According to one 
embodiment, the transgenic cells are produced by introducing a targetmg construct into a stem cell to 
produce a homologous recombinant, resultmg in a mutation of the targeted gene. In another 
embodiment, the transgenic ceils are derived from the transgenic animals described below. The cells 
derived fi-om the transgenic animals includes cells that are isolated or present in a tissue or organ, and 
any cell lines or any progeny thereof 

The present invention also provides a targeting construct and methods of producing the 
targethig construct that when introduced into stem cells produces a homologous recombmant In one 
embodiment, the targeting construct of the present mvention comprises first and second polynucleo- 
tide sequences that are homologous to the targeted gene. The targetmg construct also comprises a 
polynucleotide sequence that encodes a selectable marker that is preferably positioned between the 
two different homologous polynucleotide sequences in the construct. The targeting construct may 
also conoprise other regulatory elements that may enhance homologous reconibination. 

The present invention further provides non-human transgenic animals and methods of 
producing such non-human transgenic animals conq)rising a disruption in a target gene. The 
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taBsgenic animals of the present invention include transgenic animals that aie heterozygous and 
homozygous for a mutation in the target gene. In one aspect, the transgenic animals of the present 
mvention are defective in the ftinction of die target gene. In another aspect, the transgenic animals of 
the present invention comprise a phenotype associated wifli having a mutation in a target gene. 

The present mvention also provides methods of identifymg agents capable of affecting a 
phenotype associated with a disruption of a target gene in a transgenic animaL For example, a 
putative agent is administered to the transgenic anunal and a response of the transgenic animal to the 
putative agent is measured and compared to the response of a "normal" or wild type mouse, or 
alternatively compared to a transgenic animal control (without agent administration). The mvention 
further provides agents identified according to such methods. The present invention also provides 
methods of identifymg agents useful as flierapeutic agents for treating conditions associated with a 
disruption of the target gene. 

The present invention further provides a method of identifymg agents havmg an effect on 
target gene expression or function. The mefliod includes admmistering an effective amount of the 
agent to a transgenic anunal, preferably a mouse. The mefliod includes measuring a lesponse of the 
transgenic animal, for example, to the agent, and comparing tiie response of the transgenic anunal to a 
control animal, which may be, for example, a wild-type animal or alternatively, a transgenic animal 
control. Con5)ounds that may have an effect on target gene expression or function may also be 
screened against cells in cell-based assays, for example, to identify such compounds. 

The invention also provides cell Imes con^rismg nucleic acid sequences of a target gene. 
Such cell hues may be capable of expressmg such sequences by vhtue of operable linkage to a 
promoter functional m flie cell line. Preferably, expression of the target gene sequence is under flie 
control of an inducible promoter. Also provided are mefliods of identifying agents tiiat interact with 
die target gene, con^)rising the steps of contacting the target gene or target protem witii an agent and 
detecting an agent/ target gene or agent/target protein complex. Such con^ilexes can be detected by, 
for exan^le, measuring expression of an operably linked detectable marker. 

The invention fiuther provides mediods of treating diseases or conditions associated widi a 
disruption in a target gene, and more particularly, to a disruption in flie expression or function of flie 
target gene or the target protein encoded by flie target gene. In a preferred embodiment, methods of 
flie present invention involve treating diseases or conditions associated with a disruption m flie target 
gene's expression or function, includmg admmistering to a subject in need, a therapeutic agent which 
effects target gene or target protem expression or function. In accordance wifli fliis embodiment, flie 
mefliod con5)rises administration of a tiierapeutically effective amount of a natural, synflietic, semi- 
synflietic, or recombinant target gene, target gene products or fragments thereof as well as natural, 
synflietic, semi-synthetic or recombinant analogs. 
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The present inventioB further provides methods of treating diseases or conditions associated 
with disrupted targeted gene expression or function, wherein the methods comprise detecting and 
replacing through gene therapy mutated target genes. 

Definitions 

5 The term "gene" refers to (a) a gene containing at least one of the DNA sequences disclosed 

herein; (b) any DNA sequence that encodes the amino acid sequence encoded by the DNA sequences 
disclosed herem and/or; (c) any DNA sequence that hybridizes to the complement of the coding 
sequences disclosed herein. Preferably, the term includes coding as well as noncoding regions, and 
preferably includes all sequences necessary for normal gene expression including promoters, 

10 enhancers and other regulatory sequences. 

The terms "polynucleotide" and "nucleic acid molecule" are used interchangeably to refer to 
polymeric forms of nucleotides of any length. The polynucleotides may contain deoxyribonucleo- 
tides, ribonucleotides and/or their analogs. Nucleotides may have any three-dimensional structure, 
and may perform any function, known or unknown. The term "polynucleotide" includes single-, 

15 double-stranded and triple helical molecules. 

"OUgonucleotide" refers to polynucleotides of between 5 and about 100 nucleotides of single- or 
double-stranded DNA. Oligonucleotides are also known as oligomers or oligos and may be isolated 
from genes, or chemically synthesized by methods known in the art. A "primer" refers to an 
oligonucleotide, usually single-stranded, that provides a 3'-hydroxyl end for the initiation of enzyme- 

20 mediated nucleic acid synthesis. The following are non-Umiting embodiments of polynucleotides: a 
gene or gene fragment, exons, introns, mRNA, tRNA rRNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated 
RNA of any sequence, nucleic acid probes and primers. A nucleic acid molecule may also comprise 
modified nucleic acid molecules, such as methylated nucleic acid molecules and nucleic acid 

25 molecule analogs. Analogs of purines and pyrimidines are known in the art, and include, but are not 
limited to, aziridinycytosine, 4-acetylcytosine, 5-fluorouracil, 5-bromouracil, 5-carboxymethylamino- 
methyl-2-thiouracil, 5-carboxymethyl-aminomethyluracil, inosine, N6-isopentenyladenine, 1-methyl- 
adenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyl- 
adenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, pseudouracil, 5-pentyhiyluracil and 

30 2,6-diaminopurine. The use of uracil as a substitute for thymine in a deoxyribonucleic acid is also 
considered an analogous form of pyrimidine. 

A "fragment" of a polynucleotide is a polynucleotide comprised of at least 9 contiguous 
nucleotides, preferably at least 15 contiguous nucleotides and more preferably at least 45 nucleotides, 
of coding or non-coding sequences. 
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The term "gene targeting" refers to a type of homologous recombination that occurs when a 
fragment of genomic DNA is introduced into a mammalian cell and that fragment locates and 
recombines with endogenous homologous sequences. 

The term "homologous recombination" refers to the exchange of DNA fragments between 
5 two DNA molecules or chromatids at the site of homologous nucleotide sequmces. 

The term "homologous" as used herein denotes a characteristic of a DNA sequence having at least 
about 70 percent sequence identity as con[q>ared to a reference sequence, typically at least about 85 
percent sequence identity, preferably at least about 95 percent sequence identity, and more preferably 
about 98 percent sequence identity, and most preferably about 100 percent sequence identity as 

10 conq)ared to a reference sequence. Homology can be determined using a "BLASTN" algorithuL It is 
understood that homologous sequences can accommodate insertions, deletions and substitutions in the 
nucleotide sequence. Thus, linear sequences of nucleotides can be essentially identical even if some 
of the nucleotide residues do not precisely correspond or align. The reference sequence may be a 
subset of a larger sequence, such as a portion of a gene or flanking sequence, or a repetitive portion of 

15 a chromosome. 

The term "target gene" (alternatively refared to as "target gene sequence" or 'target DNA 

sequence" or 'target sequence") refers to any nucleic acid molecule or polynucleotide of any gene to 

be modified by homologous recombination. The target sequence includes an intact gene, an exon or 

mtron, a regulatory sequence or any region between genes. The target gene comprises a portion of a 
20 particular gene or genetic locus in the mdividual's genomic DNA. As provided herein, the target gene 

of the present invention is a platelet activating receptor gene, a PAF receptor gene, an LRP5 gene, a 

Cerberus gene, and a brainiac gene. 

A "platelet activating receptor gene" refers to a sequence con^rismg SEQ ID NO: 1 or 

conq)rising the sequence identified m Genebank as Accession No. AA162789; GL 1738456. In one 
25 aspect, the coding sequence of the platelet activating receptor gene comprises SEQ ID NO: 1 or 

comprises the gene identified in Genebank as Accession No.: AA162789; GI: 1738456. 

An "PAF receptor gene" refers to a sequence comprismg SEQ ID N0:4 or conq)rising die 

sequence identified in Genebank as Accession No. AA2741 12; GI: 1912557. hi one aspect, tiie coding 

sequence of the PAF receptor gene comprises SEQ ID N0:4 or con^rises the gene identified in 
30 Genebank as Accession No. AA2741 12; GI: 1912557. 

An *TJPR5 gene" refers to a sequence conq>rising refers to a sequence con:5)rising SEQ ID 

N0:7 or comprismg the sequence identified in Genebank as Accession No, NM_008513; GI:6678715. 

In one aspect, the coding sequence of the LPR5 gene con^rises SEQ ID NO:7 or comprises the gene 

identified in Genebank as Accession No. NM_0085 13; GI:6678715 . 
35 A "cerberus gene" refers to a sequence comprising SEQ ID NO: 10 or comprising the 

sequence identified m Genebank as Accession No. NM_009887; GI:6753409 . In one aspect, die 
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coding sequence of the cerberus gene comprises SEQ ID N0:7 or comprises the gene identified in 
Genebank as Accession No. NIVL009887; GI:6753409. 

A "brainiac gene" refers to a sequence comprismg SEQ ID NO: 13 or comprising the sequence 
identified in GenBank as Accession No.: AA133340; EST name: zn92h01.sL In one aspect, the 
5 coding sequence of the platelet activating receptor gene con5)rises SEQ ID NO: 13 or comprises the 
gene identified m Genebank as Accession No,: AAI33340; EST name: zaa92hOLsl. 

'Disruption" of a target gene occurs when a fi:agment of genomic DNA locates and 
recombines with an endogenous homologous sequence. These sequence disruptions or modifications 
may include insertions, missense, firameshift, deletion, or substitutions, or replacements of DNA 

10 sequence, or any combination thereof. Insertions include the insertion of entire genes which may be 
of animal, plant, prokaryotic, or viral origin. Disruption, for exaii5>le, can alter or replace a promoter, 
enhancer, or splice site of a target gene, and can alter the normal gene product by mhibiting its 
production partially or completely or by enhancing the normal gene product's activity. 

The term, "transgenic cell", refers to a cell contaming within its genome a target gene that has 

15 been disrupted, modified, altered, or replaced completely or partially by the naethod of gene targeting. 

The term "transgenic animal" refers to an animal that contains within its genome a specific 
gene that has been disrupted by the method of gene targeting. The transgenic animal includes both the 
hetero2ygote animal (Le., one defective allele and one wild-type allele) and the homozygous animal 
0'.e„ two defective alleles), 

20 As used herein, the terms "selectable marker" or "positive selection marker" refers to a gene 

encoding a product that enables only the cells that carry the gene to survive and/or grow under certaui 
conditions. For example, plant and animal cells that express the introduced neomycin resistance 
(NeoO gene are resistant to the compound G418. Cells that do not carry the Neo"^ gene marker are 
killed by G418. Other positive selection markers will be known to those of skill in the art, 

25 A "host cell" includes an individual cell or cell culture which can be or has been a recipient 

for vector(s) or for incorporation of nucleic acid molecules and/or proteins. Host cells include 
progeny of a single host cell, and the progeny may not necessarily be completely identical (in 
morphology or in total DNA complement) to the original parent due to natural, accidental, or 
deliberate mutation. A host cell includes cells transfected with the constructs of the present invention. 

30 The term "modulates" as used herein refers to the inhibition, reduction, increase or 

enhancement of a target gene or target protein function, expression, activity, or alternatively a 
phenotype associated with a disruption in a target gene. 

The term "ameliorates" refers to a decreasing, reducuig or eUmmating of a condition, disease, 
disorder, or phenotype, includmg an abnormality or syn^ptom associated with a disruption in a target 

35 gene. 
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The term "abnormality" refers to any disease, disorder, condition, or phenotype in which a 
disruption of a target gene is implicated, including pathological conditions. 

Brief Description of the Drawuigs 
Figure I shows the polynucleotide sequence for a platelet activating receptor gene (SEQ ID 

5 NO:l). 

Figure 2A-2B shows design of the targeting construct used to disrupt platelet activating 
receptor genes. Figure 2B shows the sequences identified as SEQ ID N0:2 and SEQ ID N0:3, which 
were used as the targeting arms (homologous sequences) in the targeting construct. 

Figure 3 shows a graph relating to the hot plate testing on mice having a disruption in a 
10 platelet activating receptor genes. 

Figure 4 shows a graph relating to the open field testmg on mice having a disruption in a 
platetet activating receptor genes. 

Figure 5 shows ttie polynucleotide sequence for a FAF receptor gene (SEQ ID N0:4). 

Figure 6A-6B shows design of the targeting construct used to disrupt FAF genes. Figure 6B 
15 shows the sequences identified as SEQ ID N0:5 and SEQ ID NO:6, which were used as the targeting 
arms (homologous sequences) in the targeting construct. 

Figure 7A-7C shows the polynucleotide sequence for a LPR5 gene (SEQ K) N0:7). 

Figure 8 shows tihe amino acid sequence for a LPR5 polypeptide (SEQ ID NO: 8). 

Figure 9A-9C shows design of the targeting construct used to disrupt LPR5 genes. Figure 9C 
20 shows the sequences identified as SEQ ID N0;9 and SEQ ID NO: 10, which were used as the 
targeting arms (homologous sequences) in the targeting construct. 

Figure 10 shows a graph relating to the open field testing (time spent in the central region) of 
mice having a dismption in a LPR5 gene. 

Figure 1 1 shows a graph relatmg to the open field testing (total distance traveled) of mice 
25 havmg a disruption in a LPR5 gene. 

Rgure 12 shows the polynucleotide sequence for the cerberus gene (SEQ ID NO: 11) and the 
amino acid sequence of a cerberus polypeptide (SEQ ID NO: 12). 

Figure 13A-13B shows design of the targeting construct used to disrupt cerberus genes. 
Figure 13B shows the sequences identified as SEQ ID NO: 13 and SEQ E) NO: 14, which were used 
30 as the targetmg arms (homologous sequences) in the targeting construct. 

Figure 14 shows a graph relating to the open field testing of mice having a dismption in a 
cerberus gene. 

Figure 15 shows a graph relating to the tail suspension testing of mice having a disruption in a 
cerberus gene. 
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Figure 16 shows the polynucleotide sequence of a brainiac gene (SEQ ID NO: 17) and SEQ 
ID NO: 16 and SEQ ID NO: 17 which were used as the targeting arms (homologous sequences) in the 
brauiiac gene targeting construct. 

Detailed Description of the Inveation 
5 The invention is based, in part, on the evaluation of the expression and role of genes and gene 

expression products, primarily those associated with a target gene. Among others, the invention 
permits the definition of disease pathways and the identification of diagnostically and therapeutically 
useful targets. For exarcple, genes which are mutated or down-regulated imder disease conditions 
may be involved in causing or exacerbating the disease condition. Treatments directed at up- 
10 regulatmg the activity of such genes or treatments which involve alternate pathways, may ameliorate 
the disease condition. 

Generation of Tarpetine Construct 

The targeting construct of the present invention may be produced using standard methods 
known m the art. {See, e,g., Sambrook, et al, 1989, Molecular Cloning: A Laboratory Manual, 

15 Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; E.N. Glover 
(eds.), 1985, DNA Cloning: A Practical Approach, Yohm^s I and 11; M.J, Gait (ed.), 1984, 
Oligonucleotide Synthesis; B.D. Hames & S.J. Higgins (eds.), 1985, Nucleic Acid Hybridization; B.D. 
Hames & SJ. Higgins (eds.), 1984, Transcription and Translation; R.L Freshney (ed.), 1986, Animal 
Cell Culture; Immobilized Cells and Enzymes, IRL Ptess, 1986; B. Perbal, 1984, A Practical Guide 

20 To Molecular Cloning; F.M. Ausubel et a/., 1994, Current Protocols in Molecular Biology, John 
Wiley & Sons, Inc.). For example, the targetmg constmct may be prepared in accordance with 
conventiorud ways, where sequences may be synthesized, isolated from natural sources, manipulated, 
cloned, ligated, subjected to in vitro mutagenesis, primer repair, or the like. At various stages, the 
joined sequences may be cloned, and analyzed by restriction analysis, sequencing, or the like. 

25 The targeting DNA can be constructed using techniques well known in the art. For example, 

the targeting DNA may be produced by chemical synthesis of oligonucleotides, ruck-translation of a 
double-stranded DNA ten^plate, polymerase chain-reaction anjplification of a sequence (or ligase 
chain reaction amplification), purification of prokaryotic or target cloning vectors harboring a 
sequence of interest (e.g. , a cloned cDNA or genomic DNA, synthetic DNA or from any of the 

30 aforementioned combination) such as plasmids, phagemids, YACs, cosmids, bacteriophage DNA, 

other viral DNA or replication intermediates, or purified restriction fragments thereof, as well as other 
sources of smgle and double-stranded polynucleotides having a desired nucleotide sequence. 
Moreover, the length of homology may be selected using known methods in the art. For example, 
selection may be based on the sequence composition and complexity of the predetermined 

35 endogenous target DNA sequence(s). 
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The targeting construct of the present invention typically coiiq)rises a first sequence 
homologous to a portion or region of the target gene and a second sequence homologous to a second 
portion or region of the target gene. The targeting construct further comprises a positive selection 
marker, which is preferably positioned m between the first and the second DNA sequence that are 
5 homologous to a portion or region of the target DNA sequence. The positive selection marker may be 
operatively linked to a promoter and a polyadenylation signal. 

Other regulatory sequences known in the art may be mcoiporated into the targeting construct 
to disrupt or control expression of a particular gene m a specific cell type. In addition, the targeting 
construct may also include a sequence coding for a screening marker, for exanq)le, green fluorescent 

10 protem (GFP), or another modified fluorescent protein. 

Although the size of the homologous sequence is not critical and can range fix)m as few as 50 
base pairs to as many as 100 kb, preferably each fragment is greater than about 1 kb in length, more 
preferably between about 1 and about 10 kb, and even more preferably between about 1 and about 5 
kb. One of sldll in the art will recognize that although larger fragments may mcrease the nunaber of 

15 homologous recombination events in ES cells, larger fragments will also be more difficult to clone. 
In a preferred embodiment of the present invention, the targeting construct is prepared 
direcfly from a plasmid genomic library using the methods described m pendmg U.S. Patent Applica- 
tion No/. 08/971,310, filed November 17, 1997, the disclosure of which is incorporated herein in its 
entnety . Generally, a sequence of interest is identified and isolated from a plasmid library in a smgle 

20 step using, for Qxsanple, long-range PGR. Following isolation of this sequence, a second polynucleo- 
tide that will disrupt the target sequence can be readily inserted between two regions encoding the 
sequence of interest In accordance with this aspect, the construct is generated in two steps by 
(1) amplifying (for example, using long-range PGR) sequences homologous to the target sequence, 
and (2) mserting anoflier polynucleotide (for example a selectable marker) into the PGR product so 

25 that it is flanked by the homologous sequences. Typically, the vector is a plasmid from a plasmid 
genomic library. The con^leted construct is also typically a circular plasmid. 

la another embodiment, the targetmg construct is designed in accordance with the regulated 
positive selection method described m U.S. Application No. 60/232,957, filed September 15, 2000, 
the disclosure of which is incorporated herein in its enthrety. The targeting construct is designed to 

30 include a PGK-neo fusion gene having two lacO sites, positioned in the PGK promoter and an NLS- 
lad gene comprising a lac repressor fused to sequences encoding the NLS from the SV40 T antigen. 

la another embodunent, the targeting construct may contain more than one selectable maker 
gene, mcludmg a negative selectable marker, such as die herpes simplex virus tk (HSV-fk) gene. The 
negative selectable marker may be operatively linked to a promoter and a polyadenylation signal. 

35 (See, e.g., U.S. Patent No. 5,464,764; U.S. Patent No. 5,487,992; U.S. Patent No. 5,627,059; and U.S. 
Patent No. 5,631,153). 
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5 

Generation of Cells and Caafinnatioa of Homologous Recombination Events 
Once an appropriate targeting construct has been prepared, the targeting construct may be 
introduced into an appropriate host cell using any method known in the art. Various techniques may 
be en5)loyed in the present invention, including, for example, pronuclear microinjection; retrovirus 
5 mediated gene transfer into germ lines; gene targeting in embryonic stem cells; electroporation of 
embryos; sperm-mediated gene transfer; and calcium phosphate/DNA co-precipitates, microinjection 
of DNA into flie nucleus, bacterial protoplast fusion with intact cells, transfection, polycations, e.g., 
polybrene, polyomithine, etc, or die like (See, e.g., U.S. Pat No. 4,873,191; Van der Putten, etal, 
1985, Proc. Natl Acad. ScL, USA 82:6148-6152; Thompson, etal, 1989, Cell 56:313-321; Lo, 1983, 

10 MolCelL Biol 3:1803-1814; Lavitrano, etaUm9Xelh 57:717-723). Various techniques for 
transforming mammalian cells are known in the art, (See, e.g„ Gordon, 1989, Ml Rev. Cytol, 
115:171-229; Keown et al, 1989, Methods in Enzymology; Keown et al., 1990, Methods and 
Enzymology, Vol. 185, pp. 527-537; Mansour et aU 1988, Nature, 336:348-352). 

hi a prefeaed aspect of the present invention, the targeting construct is mtroduced into host 

15 cells by electroporation. In this process, electrical impulses of high field strength reversibly 

peraieabilize biomembranes allowing the introduction of the construct. The pores created during 
electroporation permit tiie uptake of macromolecules such as DNA. (See, e.g.. Potter, H., et al, 1984, 
Proc. Natl Acad. Set U.S.A. 81:7161-7165). 

Any cell type capable of homologous recombmation may be used in the practice of the 

20 present invention. Examples of such target cells include cells derived ftom vertebrates including 
mammals such as humans, bovine species, ovine species, miniae species, simian species, and ether 
eucaryotic organisms such as filamentous fimgi, and higher multicellular organisms such as plants. 

Preferred cell types include embryonic stem (ES) cells, which are typically obtained from pre- 
m5)lantation embryos cultured in vitro, (See, e.g., Evans, M. L, et al, 1981, Nature 292:154-156; 

25 Bradley, M. O., et al, 1984, Nature 309:255-258; Gossler et al, 1986, Proc. Natl Acad. Set USA 
83:9065-9069; and Robertson, et al, 1986, Nature 322:445-448). The ES cells are cultured and 
prepared for introduction of the targeting constract using methods well known to the skilled artisan. 
(See, e.g., Robertson, E. J. ed. "Teratocarcmomas and Embryonic Stem Cells, a Practical Approach". 
IRL Press, Washmgton D.C., 1987; Bradley et d., 1986, Current Topics in Devel Biol 20:357-371; 

30 by Hogan et al m "Manipulating the Mouse Embryo": A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor N.Y., 1986; Thomas et al, 1987, CeZ/ 51:503; KoUer et al, 
1991, Proc. Natl Acad, Sci USA, 88:10730; Dorm etal, 1992, Trmisgenic Res. 1:101; and Veis et 
al, 1993, Cell 75:229). The ES cells tiiat will be inserted with the targeting constract are derived 
from an embryo or blastocyst of the same species as the developmg embryo into which they are to be 

35 introduced. ES cells are typically selected for their ability to integrate into flie inner cell mass and 
contribute to the germ line of an individual when introduced into the mammal in an embryo at the 
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blastocyst stage of development. Thus, any ES cell line having this capability is suitable for use in the 
practice of the present invention. 

The present invention may also be used to knockout genes in other cell types, such as stem 
cells. By way of exan5)le, stem cells may be myeloid, lymphoid, or neural progenitor and precursor 
5 cells. These cells comprising a disnq)tion or knockout of a gene may be particularly useful in the 
study of target gene function m individual developmental pathways. Stem cells may be derived from 
any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, human, non-human primates and the 
like. 

After the targetmg construct has been introduced into cells, the cells where successful gene 

10 targeting has occurred are identified. Insertion of the targetmg construct mto the targeted gene is 

typically detected by identifying cells for expression of the marker gene. In a preferred embodiment, 
the cells transformed with the targeting construct of the present invention are subjected to treatment 
with an appropriate agent that selects against cells not expressing the selectable marker. Only those 
cells expressing the selectable marker gene survive and/or grow under certain conditions. For 

15 example, cells that express the introduced neomycin resistance gene are resistant to the compound 
G418, while cells that do not express the neo gene marker are killed by G418. If the targeting 
construct also corqprises a screening marker such as GFP, homologous recombination can be 
identified through screerung cell colonies under a fluorescent light. Cells that have undergone 
homologous recorhbination will have deleted the GFP gene and will not fluoresce. 

20 If a regulated positive selection method is used in identifymg homologous recombination 

events, the targeting construct is designed so that the expression of the selectable marker gene is 
regulated in a maimer such that expression is inhibited following random integration but is permitted 
(derepressed) following homologous recombination. More particularly, the transfected cells are 
screened for expression of the neo gene, which requires that (1) the cell was successfully 

25 electroporated, and (2) lac repressor inhibition of neo transcription was relieved by homologous 
recombmation. This method allows for the identification of transfected cells and homologous 
recombinants to occur in one step with the addition of a single drug. 

Alternatively, a positive-negative selection technique may be used to select homologous 
reconibinants. This technique involves a process in which a first drug is added to the cell population, 

30 for example, a neomycin-like drug to select for growfli of transfected cells, ie. positive selection. A 
second drug, such as FIAU is subsequentiy added to kill cells that express the negative selection 
marker, Le. negative selection. Cells that contain and ej^ress the negative selection marker are killed 
by a selecting agent, whereas cells that do not contain and e3q>ress the negative selection marker 
survive. For example, ceUs with non-homologous insertion of the construct ej^ross HS V thymidine 

35 kmase and therefore are sensitive to the herpes drugs such as gancyclovk (GANG) or FIAU (l-(2" 
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deoxy 2-fluoro-B-D-arabinofluranosyl)-5-iodouracil). (See, e,g., Mansour et al,. Nature 336:348-352: 
(1988); Capecchi, Science 244:1288-1292, (1989); CapeccW, Trends in Genet 5:70-76 (1989)). 

Successful recombination may be identified by analyzing the DNA of the selected cells to 
confirm homologous recombination. Various techniques known in the art, such as PGR and/or 
Southern analysis may be used to confirm homologous recombination events. 

Homologous recombination may also be used to disrupt genes in stem cells, and other cell 
types, which are not totipotent embryonic stem cells. By way of example, stem cells may be myeloid, 
lynqjhoid, or neural progenitor and precursor cells. Such transgenic cells may be particularly useful 
in the study of gene function in individual developmental pathways. Stem cells may be derived from 
any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, human, non-human primates and the 
like. 

In cells which are not totipotent it may be desirable to knock out both copies of the target 
using methods which are known in the art. For exan5)le, cells comprising homologous recombination 
at a target locus which have been selected for expression of a positive selection marker (e.g., Neo") 
and screened for non-random integration, can be further selected for multiple copies of the selectable 
marker gene by exposure to elevated levels of the selective agent (e.g., (3418). The cells are then 
analyzed for homozygosity at the target locus. Altematively, a second construct can be generated 
with a different positive selection marker inserted between the two homologous sequences. The two 
constructs can be mtroduced into the cell either sequentially or simultaneously, followed by 
appropriate selection for each of the positive marker genes. The final cell is screened for homologous 
recombination of both alleles of the target. 

Production of Transgenic Animals 

Selected cells are then injected into a blastocyst (or other stage of development suitable for 
the purposes of creating a viable animal, such as, for example, a morula) of an aiumal (e.g. , a mouse) 
to form chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, E. J. Robertson, ed., IRL, Oxford, pp. 113-152 (1987)). Altematively, selected ES cells 
can be allowed to aggregate with dissociated mouse embryo cells to form the aggregation chimera, A 
chimeric embryo can then be in:q)Ianted into a suitable pseudopregnant female foster animal and the 
embryo brougjit to term Chimeric progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the homologously 
recombined DNA. In one embodiment, chimeric progeny mice are used to generate a mouse wifli a 
heterozygous disruption in the gene. Heterozygous transgenic mice can then be mated. It is well 
know in the art that typically Va. of the offspring of such matings will have a homozygous disruption in 
the target gene. 

The heterozygous and homozygous transgenic mice can then be compared to normal, wild 
type mice to determine whether disruption of tiie target gene causes phenotypic changes, especially 
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pathological changes. For exeanplc, heterozygous and homozygous mice may be evaluated for 
phenotypic changes by physical examination, necropsy, histology, clinical chemistry, conq)lete blood 
count, body weight, organ weights, and cytological evaluation of bone marrow. 

In one embodiment, the phenotype (or phenotypic change) associated with a disruption in the 
target gene is placed into or stored in a database. Preferably, the database includes: (i) genbtypic data 
(e.g., identification of die disrupted gene) and (ii) phenotypic data (e.g., phenotype(s) resultmg from 
the gene disruption) associated with the genotypic data. The database is preferably electronic. In 
addition, the database is preferably combined with a search tool so that the database is searchable. 

Conditional Transgenic Animals 

The present mvention further contenq>lates conditional transgenic or knockout animals, such 
as those produced using recombination methods. Bacteriophage PI Cre recombinase and flp 
recombinase fi-om yeast plasmids are two non-limiting exanoples of site-specific DNA recombinase 
enzymes which cleave DNA at specific target sites flox P sites for cre recombinase and fit sites for flp 
recombmase) and catalyze a ligation of this DNA to a second cleaved site. A large number of suitable 
alternative site-specific recombinases have been described, and their genes can be used in accordance 
with the method of the present invention. Such recombinases include the Iht recombinase of bacterio- 
phage X (with or without Xis) (Weisberg, R. et al., hi Lambda II (Hendrix, R., et at, Eds.), Cbld 
Spring Harbor Press, Cold Spring Iferbor, NY, pp. 211-50 (1983), herem incorporated by inference); 
Tpnl and the p-lactamase transposons (Mercier, et d., J. Bacterial, 172:3745-57 (1990)); the Tn3 
resolvase (Flanagan & Femiewald /. Molea Biol, 206:295-304 (1989); Stark, etal,, Cell, 58:779-90 
(1989)); the yeast recombinases (Matsuzaki, et al„ J, Bacterial, 172:610-18 (1990)); the 5. subtilis 
SpoIVC recombmase (Sato, et al, J, Bacterial 172:1092-98 (1990)); the Hp recombinase (Schwartz 
& Sadowski, /. MalecBiol, 205:647-658 (1989); Parsons, etal, J. Biol Chertu, 265:4527-33 (1990); 
GoUc & lindquist. Cell, 59:499-509 (1989); Amux et al, J. Molec, Biol, 214:55-72 (1990;); the Bin 
recombmase (Glasgow, etal, /. Biol Chetn., 264:10072-82 (1989)); immunoglobulin recombinases 
(Malynn, et al. Cell 54:453-460 (1988)); and the Cm recombmase (EMifter & Bickle, EMBO 
7:3991-3996 (1988); Hubner, et al., I Molea Biol, 205:493-500 (1989)), aU herem incorporated by 
reference. Such systems are discussed by Echols (/. Biol Chem. 265:14697-14700 (1990)); de 
Villartay {Nature, 335:170-74 (1988)); Qaig, {Ann. Rev. Genet, 22:77-105 (1988)); Poyart-Sahneron, 
etal., {EMBO J. 8:2425-33 (1989)); Hunger-Bertling, etal {Mol Cell Biochem., 92:107-16 (1990)); 
and Gregg & Madden {Mol Gen. Genet., 219:320-23 (1989)), all herein incorporated by reference. 

Cre has been purified to homogeneity, and its reaction with the loxP site has been extensively 
characterized (AbremsM & Hess J, Mol Biol 259:1509-14 (1984), herein incorporated by reference). 
Ore protein has a molecular weight of 35,000 and can be obtained commercially fix)m New England 
Nuclear/DuPont. The cre gene (which encodes the Cre protein) has been cloned and expressed 
(Abremski, et al. Cell 32:1301-11 (1983), herem incorporated by reference). The Cre protein 
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mediates recombination between two loxP sequences (Sternberg, et al Cold Spring Harbor Symp. 
Quant Biol 45:297-309 (198 1)), which may be present on the same or different DNA molecule. 
Because llie internal spacer sequence of the loxP site is asymmetrical, two loxP sites can exhibit 
directionality relative to one another (Hoess & AbremsM Proc. Natl Acad. Set U.SA. 81:1026-29 
(1984 j). Thus, when two sites on the same DNA molecule are in a dnrectly repeated orientation, Cre 
will excise the DNA between the sites (Abremski, et al Cell 32:1301-11 (1983)). However, if the 
sites are inverted with respect to each other, the DNA between them is not excised after recombina- 
tion but is simply inverted. Thus, a cncular DNA molecule having two loxP sites in direct orientation 
will recombme to produce two smaller circles, whereas circular molecules having two loxP sites in an 
inverted orientation surply invert the DNA sequences flanked by the loxP sites. In addition, 
leconabinase action can result in reciprocal exchange of regions distal to the target site when targets 
are present on separate DNA molecules. 

Recombinases have imqportant application for characterizing gene function in knockout 
models. When the constructs described herem are used to disnqjt target genes, a fusion transcript can 
be produced when faisertion of the positive selection marker occurs downstream (3*) of the translation 
initiation site of the target gene. The fiasion transcript could result m some level of protein expression 
with unknown consequence. It has been suggested that insertion of a positive selection marker gene 
can affect the expression of nearby genes. These eiffects may make it difBcuIt to determine gene 
function after a knockout event since one could not discern whether a given phenotype is associated 
with the inactivation of a gene, or the transcription of nearby genes. Both potential problems are 
solved by exploiting recombinase activity. When the positive selection marker is flanked by 
recombinase sites in the same orientation, the addition of the corresponding recombmase will result m 
the removal of the positive selection marker. In this way, effects caused by the positive selection 
Doarker or e?q)ression of fusion transcripts are avoided. 

In one embodiment, purified recombinase enzyme is provided to the cell by direct 
micromjection. Jn another embodunent, recombinase is expressed from a co-transfected construct or 
vector in which the recombinase gene is operably linked to a functional promoter. An additional 
aspect of this embodiment is the use of tissue-specific or inducible recombinase constructs which 
allow the choice of when and where recombination occurs. One method for practicing the inducible 
forms of recombinase-mediated recombmation involves the use of vectors that use inducible or tissue- 
specific promoters or other gene regulatory elements to express the desired recombinase activity. The 
inducible expression elements are preferably operatively positioned to allow the inducible control or 
activation of ejqpression of the desited recombinase activity. Exanq)les of such inducible promoters or 
other gene regulatory elements include, but are not limited to, tetracycline, metallothionine, ecdysone, 
and other steroid-responsive promoters, rapamycin responsive promoters, and the like (No; et al 
Proc, Natl Acad ScL USA, 93:3346-51 (1996); Furth, etal Proc, Natl Acad ScL USA, 91:9302-6 
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(1994)). Additional control elements that can be used include promoters requiring specific 
transcription factors such as viral, promoters. Vectors incorporating such promoters would only 
express recombinase activity in cells that express the necessary transcription factors. 
Models for Disease 

The cell- and animal-based systems described herein can be utilized as models for diseases. 
Animal s of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 
goats, and non-human primates, e.^., baboons, monkeys, and chimpanzees may be used to generate 
disease animal models. In addition, cells from humans may be used. These systems may be used in a 
variety of applications. Such assays may be utilized as part of screening strategies designed to 
identify agents, such as confounds which are capable of ameliorating disease syn^toms. Thus, the 
ammal- and cell-based models may be used to identify drugs, pharmaceuticals, flierapies and 
interventions which may be effective in treating disease. 

Cell-based systems may be used to identify corr5)ounds which may act to ameliorate disease 
symptoms. For example, such cell systems may be exposed to a compound suspected of exhibiting an 
ability to ameliorate disease symptoms, at a sufficient concentration and for a time sufficient to elicit 
such an ameUoration of disease symptoms in the exposed cells. After ejqposuie, the ceUs are 
examined to determme whether one or more of the disease cellular phenotypes has been altered to 
resemble a more noruGial or more wild type, non-disease phenotype. 

In addition, animal-based disease systems, such as those described herein, may be used to 
identify compounds capable of amelioratmg disease symptoms. Such animal models may be used as 
test substrates for the identification of drugs, pharmaceuticals, therapies, and interventions which may 
be effective in treating a disease or other phenotypic characteristic of the animal. For exarqple, animal 
models may be exposed to a conqpound or agent suspected of exhibiting an ability to ameliorate 
disease symptoms, at a sufficient concentration and for a time sufficient to elicit such an amelioration 
of disease syn^ptoms in the exposed animals. The response of the animals to the exposure may be 
monitored by assessing the reversal of disorders associated with the disease. Exposure may involve 
treating mother animals during gestation of the model animals described herein, tiiereby exposing 
embryos or fetuses to the compound or agent which may prevent or ameliorate the disease or 
phenotype. Neonatal, juvenile, and adult animals can also be exposed. 

More particularly, using the animal models of the mvention, specifically, transgenic mice, 
methods of identifymg agents, including compounds are provided, preferably, on the basis of the 
ability to afifisct at least one phenotype associated with a disruption in a target gene. Iq one embodi- 
ment, the present invention provides a method of identifyuig agents having an effect on target gene or 
alternatively, target protem expression or function. The method includes measuring a physiological 
response of the animal, for example, to the agent, and con^aring the physiological response of such 
anhnal to a control animal, wherein the physiological response of the animal conq)rising a disruption 



wo 01/98508 



PCT/USO 1/20015 



-19- 

in a target gene as compared to the control animal indicates the specificity of the agent. A "physiolo- 
gical response" is any biological or physical parameter of an animal which can be measured. 
Molecular assays (e.g., gene transcription, protein production and degradation rates), physical 
parameters (e.g., exercise physiology tests, measurement of various parameters of respiration, 
5 measurement of heart rate or blood pressure, measurement of bleeding tkne, aPTT.T, or TT), and 
cellular assays (e,g,,. immunohistochemical assays of ceU surface markers, or the ability of cells to 
aggregate or proliferate) can be used to assess a physiological response. The transgenic animals and 
cells of the present invention may by utilized as models for diseases, disorders, or conditions 
associated with phenotypes relating to a disruption in a target gene. 
10 The present invention provides a unique animal model for testing and developing new 

treatments relating to the behavioral phenotypes. Analysis of the behavioral phenotype allows for the 
development of an animal model useful for testing, for instance, die efBcacy of proposed genetic and 
pharmacological therapies for human genetic diseases, such as neurological, neuropsychological, or 
psychotic ilhiesses. 

15 A statistical analysis of the various behaviors measured can be carded out using any 

conventional statistical program routinely used by those skilled in the art (such as. for example, 
"Analysis of Variance" or ANOVA). A "p" value of about 0.05 or less is generally considered to be 
statistically significant, although slightly higher p values may still be indicative of statistically 
significant differences. To statistically analyze abnormal behavior, a conq)arison is made between the 

20 behavior of a transgenic animal (or a group thereof) to die behavior of a wild-type mouse (or a group 
thereof), typically under certain prescribed conditions. "Abnormal behavior" as used herein refers to 
behavior exhibited by an animal having a disruption in the target gene, e.g. transgenic animal, which 
differs fi:om an animal without a dismption in the target gene, e.g. wild-type mouse. Abnormal 
behavior consists of any number of standard behaviors that can be objectively measured (or observed) 

25 and con:q)ared. In the case of conqiarison, it is preferred fliat the change be statistically significant to 
confirm that there is indeed a meaningfiil behavioral difference between the knockout animal and the 
wild-type control animal. Exan^iles of behaviors which may be measured or observed include, but 
are not limited to, ataxia, rapid limb movement, eye movement, breathing, motor activity, cognition, 
emotional behaviors, social behaviors, hyperactivity, hypersensitivity, anxiety, in?)aiced learning, 

30 abnormal reward behavior, and abnormal social interaction, such as aggression. 

A series of tests may be used to measure the behavioral phenotype of the animal models of 
the present invention, including neurological and neuropsychological tests to identify abnormal 
behavior. These tests may be used to measure abnormal behavior relating to, for exanq)le, learning 
and memory, eating, pain, aggression, sexual reproduction, anxiety, depression, schizophrenia, and 

35 drug abuse. (See, e.g., Crawley and Paylor, Hormones and Behavior 31: 197-21 1 (1997)), 
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The social interaction test involves exposing a mouse to other animals in a variety of settings. 
The social behaviors of the animals (e.g., touching, climbing, snifOng, and mating) are subsequently 
evaluated. Differences in behaviors can then be statistically analyzed and compared {See, e.g„ S. E. 
File, et al,, Pharmacol Bioch. Behav. 22:941-944 (1985); R. R. Holson, Phys. Behav. 37:239-247 
(1986)). Examplary behavioral tests mclude flie following. 

The mouse startle response test typically involves e;q)osing the animal to a sensory (typically 
auditory) stunulus and measuring the starfle response of the animal (see, e,g„ M. A. Geyer, et al.. 
Brain Res, Bull 25:485-498 (1990); Paylor and Crawley, Psychophannacology 132:169-180 (1997)). 
A pre-pulse mhibition test can also be used, in which the percent inhibition (from a normal startle 
response) is measured by "cuemg" tiie anunal first wifli a brief low-intensity pre-pulse prior to the 
startie pulse. 

The electric shock test generally involves exposure to an electrified surface and measurement 
of subsequent behaviors such as, for example, motor activity, learning, social behaviors. The behavi- 
ors are measured and statistically analyzed usmg standard statistical tests. {See, e.g„ G. J. Kant, et al, 
Pharm. Bioch. Behav. 20:793-797 (1984); N. J. Leidenheimer, etal., Pharmacol Bioch Behav. 
30:351-355 (1988)). 

The tail-pinch or immobilization test involves applying pressure to the tail of the animal 
and/or restraining the animal's movements. Motor activity, social behavior, and cognitive behavior 
are exanq)les of die areas that are measured. (See, e.g„ M. Bertolucci D'Angic, et al, Neurochem, 
55:1208-1214 (1990)). 

The novelty test generally comprises exposure to a novel envhronment and/or novel objects. 
The animal's motor behavior in the novel environment and/or around the novel object are measured 
and statisticaUy analyzed. (See, e.g., D. K. Reinstein, etal, Phamu Bioch Behav. 17:193-202 (1982); 
B. Poucet, Behav. Neuroscl 103: 1009-10016 (1989); R. R. Holson, et al, Phys. Behav. 37:231-238 
(1986)). This test may be used to detect visual processing deficiencies or defects. 

The learned helplessness test mvolves exposure to stresses, for exanple, noxious stimuli, 
which caimot be affected by the animal's behavior. The animal's behavior can be statistically 
analyzed using various standard statistical tests. (See, e.g.,A. Leshner, etal, Behav. Neural Biol 
26:497-501 (1979)). 

Alternatively, a tail suspension test may be used, in which the 'Immobile" time of the mouse 
is measured when suspended "upside-down" by its tail. This is a measure of whethw- the animal 
struggles, an indicator of depression, Li humans, depression is beUeved to result from feelings of a 
lack of control over one's life or situation. It is beheved that a depressive state can be elicited in 
animals by repeatedly subjecting them to aversive situations over which they have no control. A 
condition of 'learned helplessness" is eventually reached, in which the ammal will stop trymg to 
change its circumstances and simply accept its fate. Animals that stop struggling sooner are believed 
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to be more prone to depression. Studies have shown that the adnimistration of certain antidepressant 
drugs prior to testing increases the amount of time that animals struggle before giving up. 

The Morris water-maze test comprises learning spatial orientations in water and subsequently 
measuring the animal's behaviors, such as, for example, by counting the number of mconect choices. 
5 The behaviors measured are statistically analyzed usmg standard statiistical tests. {See^ e.g,, E. M. 
Spruijt, etal.. Brain Res, 527:192-197 (1990)). 

Alternatively, a Y-shaped maze may be used (see, e.g., McFarland, D.J., Pharmacology, 
Biochemistry and Behavior 32:723-726 (1989); Dellu, R, et al., Neurobiology of Learning and 
Memory 73:31-48 (2000)). The Y-maze is generally believed to be a test of cognitive abiUty. The 

10 dunensions of each arm of the Y-maze can be, for example, approximately 40 cm x 8 cm x 20 cm, 

although other dknensions noay be used. Each arm can also have, for exanqile, sixteen equally spaced 
photobeams to autonoatically detect movement within the arms. At least two different tests can be 
performed using such a Y-maze. In a continuous Y-maze paradignou mice are allowed to explore all 
three arms of a Y-maze for, e.g., approximately 10 minutes. The animals are continuously tracked 

15 usmg photobeam detection grids, and the data can be used to measure spontaneous alteration and 
positive bias behavior. Spontaneous alteration refers to the natural tendency of a **normal" animal to 
visit the least familiar arm of a maze. An alternation is scored when the animal naakes two 
consecutive tunas in the same diiectioii, thus representing a sequence of visits to the least recently 
entered arm of the maze. Position bias determines egocentrically defined responses by noeasurmg the 

20 animal's tendency to favor turning in one direction over another. Therefore, the test can detect 
differences in an animal's ability to navigate on the basis of aUocentric or egocentric mechanisn^. 
The two-trial Y-maze memory test measures response to novelty and spatial memory based on a free- 
choice exploration paradigm. During the first trial (acquisition), the animals are allowed to fi:eely 
visit two aims of the Y-maze for, e.g., approximately 15 mmutes. The tiurd arm is blocked off during 

25 this trial. The second trial (retrieval) is performed after an intertrial interval of, e.g., approximately 2 
hours. During the retrieval trial, the blocked arm is opened and the animal is allowed access to all 
three arms for, e.g., approximately 5 minutes. Data are collected during the retrieval trial and 
analyzed for the number and duration of visits to each arm. Because the three arms of the maze are 
vhtually identical, discrimination between novelty and familiarity is dependent on "environmental" 

30 spatial cues around the room relative to the position of each ann. Changes in arm entry and duration 
of time spent in the novel arm in a transgenic animal model may be indicative of a role of that gene in 
mediating novelty and recognition processes. 

The passive avoidance or shuttle box test generally involves exposure to two or more 
environments, one of which is noxious, providmg a choice to be learned by the animal. Behavioral 

35 measures include, for exanqple, response latency, number of correct responses, and consistency of 
response. (See, e,g., R. Ader, et al, Psychon, ScL 26: 125-128 (1972); R. R. Holson, Phys, Behav. 
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37:221-230 (1986)). Alternatively, a zero-maze can be used. In a zero-maze, the animals can, for 
example, be placed in a closed quadrant of an elevated annular platform having, e.g., 2 open and 2 
closed quadrants, and are allowed to explore for approximately 5 minutes. This paradigm exploits an 
approach-avoidance conflict between normal exploratory activity and an aversion to open spaces in 
5 rodents. This test measures anxiety levels and can be used to evaluate the effectiveness of anti- 
amdolytic drugs. The time spent in open quadrants versus closed quadrants may be recorded 
automatically, with, for example, the placement of photobeams at each transition site. 

The food avoidance test involves exposure to novel food and objectively measuring, for 
example, food intake and intake latency. The behaviors measured arc statistically analyzed usmg 
10 standard statistical tests. {See, e.g., B. A. Campbell, et al., /. Comp. Physiol Psychol 67: 15-22 
(1969)). 

The elevated plus-maze test comprises exposiure to a maze, without sides, on a platform, the 
animal's behavior is objectively measured by counting the number of maze entries and maze learning. 
The behavior is statistically analyzed using standard statistical tests. {See, e.g., H, A. Baldwin, et al, 
15 Brain Res, Bull 20:603-606 (1988)). 

The stimulant-induced hyperactivity test involves injection of stimulant drags {e.g,, 
amphetamines, cocaine, PCP, and the like), and objectively measuring, for example, motor activity, 
social interactions, cognitive behavior. The animal's behaviors are statistically analyzed using 
standard statistical tests. {See, e.g., P. B. S. Clarke, etal., Psychopharmacology 96:511-520 (1988); P. 
20 Kuczenski, et al, J. Neuroscience 1 1:2703-2712 (1991)). 

The self-stimulation test generally comprises providing the mouse with the opportunity to 
regulate electrical and/or chemical stimuli to its own brain. Behavior is measured by frequency and 
pattern of self-stimulation. Such behaviors are statistically analyzed using standard statistical tests. 
{See, e.g., S. Nassif, et al.. Brain Res., 332:247-257 (1985); W. L. baac, et al, Behav. NeuroscL 
25 103:345-355(1989)). 

The reward test involves shaping a variety of behaviors, e.g., motor, cognitive, and social, 
measuring, for example, rapidity and reliability of behavioral change, and statistically analyzing the 
behaviors measured, {See, e.g., L. E. Jarrard, et al, Exp. Brain Res. 61:519-530 (1986)). 

The DRL (differential reinforcement to low rates of responding) performance test involves 
30 closure to intermittent reward paradigms and measuring the number of proper responses, e.g., lever 
pressing. Such behavior is statistically analyzed using standard statistical tests. {See, e,g., J. D. Sinden, 
et al, Beha^. Neurosci. 100:320-329 (1986); V, Nalwa, et al, Behav Brain Res. 17:73-76 (1985); and 
A. J. Nonneman, etal, J. Contp. Physiol Psych. 95:588-602 (1981)), 

The spatial learning test involves exposure to ^ complex novel environment, measuring the 
35 rapidity and extent of spatial learning, and statistically analyzing the behaviors measured. {See, e.g., 
N. Pitsikas, et al, Pharm. Bioch. Behav. 38:931-934 (1991); B. Poucet, et al. Brain Res. 37:269-280 
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(1990); D. Christie, et al. Brain Res. 37:263-268 (1990); and F. Van Haanen, et al., Behav, NeuroscL 
102:481-488 (1988)). Alternatively, an open-field (of) test rmy be used, in which the greater distance 
traveled for a given amoimt of time is a measure of the activity level and anxiety of the animal. When 
the open field is a novel environment, it is believed that an approach-avoidance situation is created, in 
5 which the animal is *'tom'* between the drive to explore and the drive to protect itself. Because the 
chambea- is lighted and has no places to hide other than the comers, it is expected that a **normar' 
mouse will spend more tune ui the comers and around the periphery than it will in the center where 
there is no place to hide. ''Normal" mice will, however, venture into the central regions as they 
explore more and more of the chamber. It can then be extrapolated that especially anxious mice will 

10 spend most of their time in the comers, with relatively litfle or no exploration of the central region, 
whereas bold (i.e., less anxious) mice will travel a greater distance, showing littie preference for the 
periphery versus the central region. 

The visual, somatosensory and auditory neglect tests generally comprise exposure to a 
sensory stunulus, objectively measuring, for example, orimtating responses, and statistically 

15 analyzing the behaviors measured. (See, e.g., J. M, Vargo, et al., Exp. Neurol. 102: 199-209 (1988)). 

The consummatory behavior test generally comprises feeding and drinking, and objectively 
measuring quantity of consunq)tion. The behavior measured is statistically analyzed using standard 
statistical tests. (See, e.g., P. I. Fletcher, et al., Psychopharmacol. 102:301-308 (1990); M. G. Corda, 
et al.„ Proc. Nafl Acad. Sci. USA 80:2072-2076 (1983)). 

20 A visual discrimination test can also be used to evaluate the visual processing of an animal 

One or two sumlar objects are placed in an open field and the animal is allowed to explore for about 
5-10 minutes. The tune spent exploring each object (proximity to, i.e., movement wiflun, e.g„ about 
3-5 cm of the object is considered exploration of an object) is recorded. The animal is then removed 
fiiom the open field, and the objects are replaced by a similar object and a novel object The animal is 

25 returned to the open field and the percent time spent exploring the novel object over the old object is 
measured (again, over about a 5-10 minute span). "Normal" animals will typically spend a higher 
percentage of time exploring the novel object rather than the old object. If a delay is imposed 
between sampling and testing, the memory task becomes more hippocampal-dependent. If no delay is 
iir5)0sed, the task is more based on simple visual discrimination. This test can also be used for 

30 olfactory discrimmation, in which the objects (preferably, simple blocks) can be sprayed or otherwise 
treated to hold an odor. This test can also be used to detemodne if the animal can make gustatory 
discriminations; aiumals that return to the previously eaten food instead of novel food exhibit 
gustatory neophobia. 

A hot plate analgesia test can be used to evaluate an animal's sensitivity to heat or painful 
35 stimuli. For exmsple, a mouse can be placed on an approximately 55*C hot plate and the mouse's 
response latency (e.g., time to pick up and lick a hind paw) can be recorded. These responses are not 



wo 01/98508 



PCT/USOl/20015 



-24- 

reflexes, but rather "higher" responses requiring cortical involvement. This test may be used to 
evaluate a nociceptive disorder. 

An accelerating rotarod test may be used to measure coordination and balance in noice. 
Animals can be, for exaoqple, placed on a rod that acts like a rotating treadmill (or rolling log). The 
rotarod can be made to rotate slowly at first and then progressively faster until it reaches a speed of, 
e.g., approximately 60 rpm. The mice must continually reposition themselves in order to avoid falling 
off. The animals are preferably tested in at least three trials, a minimum of 20 minutes apart. Those 
mice that are able to stay on the rod the longest are believed to have better coordination and balance. 

A metrazol administration test can be used to screen animals for varying susceptibilities to 
seizures or similar events. For exan^^le, a 5mg/ml solution of metrazol can be infused through the tail 
vem of a mouse at a rate of, e.g., approximately 0.375 ml/min. The infusion will cause all mice to 
experience seizures, followed by death. Those mice that enter the seizure stage the soonest are 
believed to be more prone to seizures. Four distinct physiological stages can be recorded: soon after 
the start of infusion, die mice will exhibit a noticeable '*twitch", followed by a series of seizures, 
ending m a final tensmg of the body known as "tonic extension", which is followed by death. 

Target Gene Products 

The present invention further contemplates use of the target gene sequence to produce target 
gene products. Target gene products may include protems that represent fimctionally equivalent gene 
products. Such an equivalent gene product may contain deletions, additions or substitutions of amino 
acid residues withm the amino acid sequence encoded by the gene sequences described herein, but 
which result in a silent change, thus producmg a functionally equivalent target gene product. Ammo 
acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobic 
city, hydrophilidty, and/or the ampfaipathic nature of flie residues involved. 

For example, nonpolar (hydrophobic) amino acids mclude alanine, leucme, isoleucine, valine, 
prolme, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycme, serine, 
threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids 
include arginine, lysine, and histidme; and negatively charged (acidic) ammo acids include aspartic 
acid and glutamic acid. "Functionally equivalent", as utilized herem, refers to a protein capable of 
exhibiting a substantially similar in vivo activity as the endogenous gene products encoded by the 
target gene sequences. Alternatively, when utilized as part of an assay, "functionally equivalent" may 
refer to peptides capable of interactmg with other cellular or extracellular molecules in a manner 
substantially similar to flie way in which the corresponding portion of the endogenous gene product 
would. 

Other protein products useftd according to the metfiods of flie invention are peptides derived 
from or based on tiie target gene produced by recombinant or synthetic means (derived peptides). 
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Target gene products may be produced by recombinant DNA technology using techniques 
well known in the art. Thus, methods for preparing the gene polypeptides and peptides of the 
invention by expressing nucleic acid encoding gene sequences are described herein. Methods which 
are well known to those skilled in the art can be used to construct expression vectors containing gene 
5 protein coding sequences and appropriate transcriptional/tianslational control signals. These methods 
include, for exanople, in vitro recombinant DNA techniques, synthetic techniques and in vivo 
recombmation/genetic recombination (see, e.g., Sanibrook, et al., 1989, supra, and Ausubel, et al., 
1989, supra). Alternatively, RNA capable of encoding gene protein sequences may be chemically 
synthesized using, for example, automated synthesizers (see, e.g. Oligonucleotide Synthesis: A 

10 Practical Approach, Gait, M. J. ed., IRL Press, Oxford (1984)). 

A variety of host-expression vector systems may be utilized to express the gene coding 
sequences of the invention. Such host-expression systems represent vehicles by which the coding 
sequences of interest may be produced and subsequently purified, but also represent cells which may, 
when transformed or transf ected with the appropriate nucleotide coding sequences, exhibit the gene 

15 protein of the invention in situ. These include but are not limited to microorganisms such as bacteria 
(e.g., B. coli, B. subtiHs) transformed with recombinant bacteriophage DNA, plasraid DNA or cosmid 
DNA expression vectors containing gene protein coding sequences; yeast {e.g, Saccharomyces, 
Pichia) transformed with recombinant yeast expression vectors containing the gene protein coding 
sequences; insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) 

20 containing the gene protein coding sequences; plant cell systems infected with recombinant virus 
expression vectors cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or trans- 
formed with recombiaaat plasmid expression vectors (e.g., Ti plasmid) containing gene protein 
coding sequences; or mammalian cell systems {e.g. COS, CHO, BHK, 293, 3T3) harboring 
recombinant e3q)ression constructs containing promoters derived from the genome of mammalian 

25 cells ie.g., metallothionine promoter) or from mammalian viruses (e.g., the adenovirus late promoter; 
the vaccinia vmis 7.5 K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the gene protein being expressed. For example, when a large 
quantity of such a protein is to be produced, for the generation of antibodies or to screen peptide 

30 libraries, for example, vectors which direct the expression of high levels of fusion protein products 
that are readily purified may be desirable. Such vectors mclude, but are not limited, to the E. coli 
expression vector pUR278 (Ruther et al., EMBO J.,- 2: 1791-94 (1983)), in which the gene protem 
coding sequence naay be ligated individually into the vector in frame with the lac Z coding region so 
that a fusion protein is produced; pIN vectors (Ihouye & hiouye. Nucleic Acids Res., 13:3101-09 

35 (1985); Van Heeke et al., J. Biol Chem., 264:5503-9 (1989)); and the like. pGEX vectors may also be 
used to ejqpress foreign polypeptides as fusion proteins with glutathione S-transferase (GST). la 
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general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors 
are designed to include thronibm or factor Xa protease cleavage sites so that the cloned target gene 
protein can be released from the GST moiety. 
5 In a preferred embodunent, fiill length cDKA sequences are appended with in-frame Bam HI 

sites at the amino terminus and Eco RI sites at the carboxyl terminus usmg standard PCR methodo- 
logies (Bmis, et al. (eds) PCR Protocols: A Guide to Methods and Applications, Academic Press, San 
Diego (1990)) and ligated into the pGEX-2TK vector (Pharmacia, Uppsala, Sweden). The resulting 
cDNA constract contains a kinase recognition site at the amino terminus for radioactive labeling and 

10 glutathione S-transferase sequences at the carboxyl terminus for affinity purification (Nilsson, et al, 
EMBO J., 4: 1075-80 (1985); Zabeau et al, EMBO J., 1: 1217-24 (1982)). 

In an insect system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes. The virus grows in Spodoptem fiiigiperda cells. The gene codmg 
sequence may be cloned individually into non-essential regions (for exaiiq)le the polyhedrin gene) of 

15 the vmis and placed under control of an AcNPV promoter (for exaixq>le the polyhedrin promoter). 
Successfid insertion of gene coding sequence will result in inactivation of the polyhedrin gene and 
production of non-occluded recombmant virus (i.e., virus lacking the proteinaceous coat coded for by 
the polyhedrin gene). These recombinant viruses are then used to mfect Spodoptem frugiperda cells 
in which the inserted gene is expressed (see, e.g.. Smith, et a/., /. Virol 46: 584-93 (1983); U.S. Pat. 

20 No. 4,745,051). 

In mammalian host cells, a number of viral-based expression systems may be utilized. In 
cases where an adenovirus is used as an expression vector, the gene coding sequence of interest may 
be ligated to an adenovkus transcription/translation control conq)lex, e.g., the late promoter and 
tripartite leader sequence. This chuneric gene may then be inserted in the adenovirus genome by in 

25 vitro or in vivo recombination. Insertion in a non-essential region of the viral genome {e.g,, region El 
or E3) win result in a recombmant virus that is viable and capable of expressmg gene protein ia 
infected hosts. {e.g., see Logaa etal, Proc. Natl Acad. Set USA, 81:3655-59 (1984)). Specific 
initiation signals may also be required for efficient translation of inserted gene coding sequences. 
These signals mclude the ATG initiation codon and adjacent sequences. In cases where an entire 

30 gene, mcluding its own initiation codon and adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control signals may be needed. However, in cases 
where only a portion of the gene coding sequence is inserted, exogenous translational control signals, 
includmg, perhaps, the ATG uiitiation codon, must be provided. Furthermore, the initiation codon 
must be in phase with the reading fi:ame of the desired coding sequence to ensure translation of the 

35 entke msert. These exogenous translational control signals and initiation codons can be of a variety of 
origins, both natural and synthetic. The efBciency of expression may be enhanced by the inclusion of 
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appropriate transcription enhancer elements, transcription terminators, etc. (see Bitter, etaL, Methods 
inEmymol, 153:516-44(1987)). 

In addition, a host cell strain may be chosen which modulates the expression of the inserted 
sequences, or modifies and processes the gene product in the specific fashion desired. Such 
5 modifications (e.g,, glycosylation) and processing (e.g., cleavage) of protein products may be 
in5)ortant for the function of the protein. Different host cells have characteristic and specific 
mechanisms for the post-translational processing and modification of proteius. Appropriate cell lines 
or host systems can be chosen to ensure die conect modification and processing of the foreign protein 
expressed. To this end, eukaryotic host cells which possess the cellular machinery for proper 

10 processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, COS, 
MDCK, 293, 3T3, WD8, etc. 

For long-term, high-yield production of recombinant proteins, stable expression is preferred. 
For exanq)le, cell lines which stably express the gene protein may be engineered. Rather than using 

15 expression vectors which contain viral origins of replication, host cells can be transformed with DNA 
controlled by appropriate expression control elements (e.g., promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following flie 
introduction of the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched 
media, and then are switched to a selective media. The selectable marker in the recombinant plasmid 

20 confers resistance to the selection and allows cells which stably integrate the plasmid into their 

chromosomes and grow, to form foci which in turn can be cloned and expanded into cell lines. This 
method may advantageously be used to engmeer cell lines which express the gene protein. Such 
engineered cell lines may be particularly useful in screening and evaluation of compounds that affect 
the endogenous activity of the gene protein. 

25 In a preferred embodiment, control of timing and/or quantity of expression of the recombinant 

protein can be controlled using an inducible e;q)ression construct. Inducible constructs and systems 
for inducible expression of recombinant protems will be well known to those skilled in the art 
Examples of such inducible promoters or other gene regulatory elements mclude, but are not limited 
to, tetracycline, metallothiooine, ecdysone, and other steroid-responsive promoters, rapamycin 

30 responsive promoters, and the like (No, et aU Proc, Natl Acad. Set USA, 93:3346-51 (1996); Furth, 
et aU Proc. Natl Acad. Set USA^ 91:9302-6 (1994)). Additional control elements that can be used 
include promoters requiring specific transcription factors such as viral, particularly HIV, promoters. 
In one in embodiment, a Tet inducible gene expression system is utilized. (Gossen et al, Proc, Natl 
AcadScL OTA, 89:5547-51 (1992); Gossen, a/., Science, 268: 1766-69 (W^^ Tet Expression 

35 Systems are based on two regulatory elements derived from the tetracycline-iesistance operon of the 
E, coli TnlO tcansposon — ^the tetracycline repressra: protein (TetR) and the tetracycline operator 
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sequence (tetO) to which TetR binds. Using such a system, expression of the recombmant protein is 
placed under the control of the tetO operator sequence and transfected or transformed into a host cell. 
Id the presence of TetR, which is co-transfected into the host cell, expression of the recombinant 
protein is repressed due to binding of the TetR protein to the tetO regulatory element. High-level, 
5 regulated gene expression can then be induced in response to varying concentrations of tetracycline 
(Tc) or Tc derivatives such as doxycycline (Dox), which compete with tetO elements for binding to 
TetR. Constructs and materials for tet inducible gene expression are available commercially from 
CLONTECH Laboratories, Inc., Palo Alto, CA. 

When used as a component in an assay system, the gene protein may be labeled, either 

10 directly or mdirectly, to facilitate detection of a complex formed between the gene protein and a test 
substance. Any of a variety of suitable labeling systems may be used including but not limited to 
radioisotopes such as 1251; enzyme labeling systems that generate a detectable calorimetric signal or 
light when exposed to substrate; and fluorescent labels. Where recombinant DNA technology is used 
to produce the gene protein for such assay systems, it may be advantageous to engineer fusion 

1 5 proteins that can facilitate labeling, immobilization and/or detection. 

Indirect labelmg involves the use of a protein, such as a labeled antibody, which specifically 
binds to the gene product. Such antibodies include but are not limited to polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced by a Fab expression library. 
Production of Antibodies 

20 Described herein are methods for the production of antibodies capable of specijBcally 

recognizkig one or more epitopes. Such antibodies may include, but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies. 
Fab fragments, F(ab')2 fragments, fragments produced by a Fab expression library, anti-idiotypic 
(anti-Id) antibodies, and epitope-binding fragments of any of the above. Such antibodies may be used, 

25 for example, in the detection of a target gene m a biological sample, or, altematively, as a method for 
the inhibition of abnormal target gene activity. Thus, such antibodies may be utilized as part of 
disease treatment methods, and/or may be used as part of diagnostic techniques whereby patients may 
be tested for abnonnal levels of target gene proteins, or for the presence of abnormal forms of such 
proteins. 

30 For the production of antibodies, various host animals may be inununized by injection with 

the target gene, its expression product or a portion thereof. Such host animals may include but aie not 
limited to rabbits, mice, and rats, to name but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, including but not limited to Freund's 
(complete and incoicplete), mineral gels such as aluminum hydroxide, surface active substances such 

35 as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole liiiq)et hemocyanin. 
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dimtrophenol, and potentially useful human adjuvants such as BCG {bacille Calmette-Guerin) and 
Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the 
sera of animals immunized with an antigen, such as target gene product, or an antigenic functional 
derivative tiieieof. For flie production of polyclonal antibodies, host animals such as fliose described 
above, may be immunized by mjection with gene product supplemented with adjuvants as also 
described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a particular 
antigen, may be obtained by any technique which provides for the production of antibody molecules 
by continuous cell Imes in culture. These mclude, but are not limited to the hybridoma technique of 
K6hler and Mflstem, Nature, 256:495-7 (1975); and U.S. Pat. No. 4,376,110), the human B-cell 
hybridoma technique (Kosbor, et aU Immunology Today, 4:72 (1983); Cote, et a/., Proc. Natl Acad. 
ScL USA, 80:2026-30 (1983)), and the EBV-hyhridoma technique (Cole, et al., in Monoclonal 
Antibodies And Cancer Therapy, Alan R. liss, Inc., New York, pp. 77-96 (1985)). Such antibodies 
may be of any unmunoglobulm class including IgG, IgM, I^, IgA, IgD and any subclass thereof. The 
hybridoma producing ttie mAb of this invention may be cultivated in vitro or m vivo. Production of 
high titers of mAbs m vivo makes this die presently preferred method of production. 

In addition, techniques developed for die production of "chimeric antibodies'* (Morrison, et 
aU Proc, Natl Acad. ScL, 81:6851-6855 (1984); Takeda, etal. Nature, 314:452-54 (1985)) by 
splicmg the genes from a mouse antibody molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are derived from different animal species, such as 
those having a variable region derived froma murme mAb and a human immunoglobulin constant 
region. 

Alternatively, techniques described for die production of single chain antibodies (U.S. Pat. 
No. 4,946,778; Bird, Science 242:423-26 (1988); Huston, etal, Proc. Natl Acad. ScL USA, 85:5879- 
83 (1988); and Ward, et al. Nature, 334:544^ (1989)) can be adapted to produce gene-smgle cham 
antibodies. Single chain antibodies are formed by Unking the heavy and light chain fragments of the 
Fv region via an amino acid bridge, resultmg in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be generated by known tech- 
niques. For example, such fragments include but are not limited to: the F(ab')2 fragments which can 
be produced by pepsm digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing die disulfide bridges of die F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constmcted (Huse, etal. Science, 246:1275-81 (1989)) to aUow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. 
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Screening Methods 

The present inventioii may be en^loyed in a process for screening for agents such as agonists, 
Le, agents that bind to and activate target gene polypeptides, or antagonists. i.e, inhibit the activity or 
interaction of target gene polypeptides with its ligand. Thus, polypeptides of the invention may also 
5 be used to assess the binding of small molecule substrates and ligands in, for exaii^)le, cells, cell-free 
preparations, chemical libraries, and natural product mixtures as known in the art. Any methods 
routinely used to identify and screen for agents that can modulate receptors may be used in 
accordance with the present invention. 

The present invention provides methods for identifymg and screening for agents that 

10 inodulate target gene ejq)ression or function. More particularly, cells that contain and ^xpitss target 
gene sequences may be used to screen for therapeutic agents. Such cells may include non- 
recombmant monocyte cell Unes, such as U937 (ATCC# CRL-1593), THP-1 (ATCC# TIB-2Q2), and 
P388D1 {ATCC# TIB-63); endotheUal cells such as HUVECs and bovine aortic endotheUal cells 
(BAECs); as well as generic mammalian cell Unes such as HeLa cells and COS cells, e.g., COS-7 

15 (ATCC#CRL-1651). Further, such cells may include recombinant, transgenic cell lines. For 

example, the transgenic mice of the invention may be used to generate cell lines, containing one or 
more cell types involved in a disease, that can be used as cell culture models for that disorder. While 
cells, tissues, and primary cultures derived from the disease transgenic animals of the invention may 
be utUized, the genemtion of continuous cell lines is preferred. For examples of techniques which 

20 may be used to derive a continuous cell Ime from the transgenic ammals, see Small, et al, Mol Cell 
Biol, 5:642-48 (1985), 

Target gene sequences may be introduced into, and overexpressed in, the genome of the cell 
of interest. In order to overexpress a target gene sequence, the coding portion of the target gene 
sequence may be ligated to a regulatory sequence which is capable of driving gene expression in the 

25 cell type of mterest. Such regulatory regions will be well known to those of skill m the art, and may 
be utilized in the absence of undue experimentation. Target gene sequences may also be disnq)ted or 
xmdeiexpiessed. Cells having target gene disruptions or underexpressed target gene sequences may 
be used, for example, to screen for agents capable of affecting alternative pathways which con^nsate 
for any loss of function attributable to the disruption or underexpression, 

30 In vitro systems may be designed to identify compounds capable of binding the target gene 

products. Such compounds may include, but are not limited to, peptides made of D-and/or L-configu- 
ration amino acids (in, for example, the form of random peptide libraries; see e.g., Lam, et aU Nature, 
354:82-4 (1991)), phosphopeptides (in, for example, the form of random or partially degenerate, 
dhected phosphopeptide hbraries; see, e.g., Songyang, et a/.. Cell 72:767-78 (1993)), antibodies, and 

35 small organic or morganic molecules. Compounds identified may be useful, for exaiqple, in modu- 
lating the activity of target gene proteins, preferably mutant target gene protems; elaborating the 
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biological function of the target gene protein; or screening for conopounds that disrupt normal target 
gene interactions or themselves disrupt such mteractions. 

The principle of the assays used to identify confounds that bind to the target gene protein 
involves preparing a reaction mixture of the target gene protein and the test con5)ound under 
5 conditions and for a time sufficient to allow the two components to interact and bmd, thus forming a 
con5)lex which can be removed and/or detected in the reaction mixture. These assays can be 
conducted in a variety of ways. For example, one method to conduct such an assay would involve 
anchoring the target gene protein or the test substance onto a solid phase and detecting target protein/ 
test substance complexes anchored on the solid phase at the end of the reaction. In one embodinaent 
10 of such a method, the target gene protein may be anchored onto a solid surface, and the test 
con^jound, which is not anchored, may be labeled, either directly or indirectly. 

In practice, microtitre plates are convenientiy utilized. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non-covalent attachment may be accom- 
plished singly by coating the solid surface with a solution of the protein and drymg. Altematively, an 
15 immobilized antibody, preferably a monoclonal antibody, specific for the protein may be used to 
anchor the protein to the solid surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the nonimmobiUzed component is added to the coated surface 
containing the anchored component. After the reaction is con^)lete, unreacted components are 
removed (e.g., by washing) under conditions such that any complexes formed will remain 
20 umnobilized on the solid surface. The detection of complexes anchored on flie solid surface can be 
accorcplished m a number of ways. Where the previously nonimmobilized conqionent is pre-labeled, 
the detection of label umnobilized on the surface indicates that complexes were formed. Where the 
previously nommmobilized component is not pre-labeled, an indirect label can be used to detect 
con^lexes anchored on the surface; e.g., usmg a labeled antibody specific for the previously nonim- 
25 mobilized con5)onent (ttie antibody, m turn, may be directly labeled or mdkectiy labeled with a 
labeled anti-Ig antibody). 

Altematively, a reaction can be conducted in a liquid phase, the reaction products separated 
fixim unreacted conq)onents, and conq)lexes detected; e.g., using an unmobilized antibody specific for 
target gene product or the test cotrpound to anchor any complexes formed in solution, and a labeled 
30 antibody specific for the other con5)onent of the possible complex to detect anchored complexes. 

Confounds that are shown to bind to a particular target gene product through one of the 
methods described above can be further tested for fliek ability to elicit a biochemical response from 
the target gene protem. Agonists, antagonists and/or inhibitors of the expression product can be 
identified utilizing assays well known in the art. 
35 Antisense, Ribozvmes> and Antibodies 
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Otber agents which may be used as therapeutics include the target gene, its expression 
product(s) and functional fragments thereof. Additionally, agents which reduce or inhibit mutant 
target gene activity may be used to ameliorate disease symptoms. Such agents include antisense, 
ribozyme, and triple helix molecules. Techniques for the production and use of such molecules are 
5 well known to those of skill in the art. 

Anti-sense RNA and DNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between the -10 and +10 
regions of the target gene nucleotide sequence of interest, are preferred. 

10 Ribozymes are enzymatic RNA molecules capable of catalyzmg the specific cleavage of 

RNA. The mechanism of ribozynae action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by an endonucleolytic cleavage. The composition 
of ribozyme molecules must include one or more sequences complementary to the target gene mRNA, 
and must include the well known catalytic sequence responsible for mRNA cleavage. For this* 

1 5 sequence, see US. Pat No. 5,093,246, which is incorporated by reference herein in its entirety. As 
such within the scope of the invention are engineered hammerhead motif ribozyme molecules that 
specifically and efficiently catalyze endonucleolytic cleavage of EINA sequences encoding target gene 
proteins. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
20 scanning the molecule of interest for ribozyme cleavage sites which include the following sequences, 
GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides 
correspondmg to the region of the target gene contaioing the cleavage site may be evaluated for 
predicted structural features, such as secondary structure, that may render the oligonucleotide 
sequence unsuitable. The suitability of candidate sequences may also be evaluated by testing their 
25 accessibility to hybridization with complementary oligonucleotides, using ribonuclease protection 
assays. 

Nucleic acid molecules to be used in triple helix formation for the inhibition of transcription 
should be smgle stranded and composed of deoxyribonucleotides. The base composition of these 
oligonucleotides must be designed to promote triple helix formation via Hoogsteen base pairing rules, 

30 which generally require sizeable stretches of either purines or pyiimidmes to be present on one strand 
of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in TAT and CGC 
triplets across the three associated strands of the resulting triple helix. The pyrimidine-rich molecules 
provide base con^^lementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to tiiat strand. In addition, nucleic acid molecules may be chosen that are purine-rich, for 

35 example, containing a stretch of G residues. These molecules will form a triple helix with a DNA 
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duplex that is rich in GC pairs, in which the majority of the purme residues are located on a single 
strand of the targeted duplex, resulting in GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be tai^eted for triple helix formation may be 
mcreased by creating a so called "switchback" nucleic acid molecule. Switchback molecules are 
synthesized in an altemathig 5 -*3', 3-5' manner, such that they base pan* with first one strand of a 
duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidmes to be present on one strand of a duplex. 

It is possible that the antisense, ribozyme, and/or triple helix molecules described herein may 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by botii normal and mutant target gene alleles. In order to ensure that substantially normal 
levels of target gene activity are maintamed, nucleic acid molecules tiiat encode and express target 
gene polypeptides exhibiting normal activity may be introduced into cells that do not contain 
sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are being utilized. 
Alternatively, it may be preferable to coadminister normal target gene protein into die cell or tissue in 
order to maintain the requisite level of cellular or tissue target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may be 
prepared by any mefliod known m the art for die synthesis of DNA and RNA molecules. These 
include techniques for chemically synthesizing oligodeoxyribonucleotides and oligoribonucleotides 
well known in the art such as for example solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by m vitro and in vivo transcription of DNA 
sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated into a 
wide variety of vectors which incorporate suitable RNA polymerase promoters such as tiie T7 or SP6 
polymerase promoters. Alternatively, antisense cDNA constmcts that synfliesize antisense RNA 
constitutively or mducibly, dependmg on the promoter used, can be introduced stably into cell lines. 

Various well-known modifications to tiie DNA molecules may be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include but are not limited to flie 
addition of flanking sequences of ribonucleotides or deoxyribonucleotides to the 5' and/or 3' ends of 
flie molecule or the use of phosphorothioate or 2' 0-methyl ratiier than phosphodiesterase linkages 
within the oligodeoxyribonucleotide backbone. 

Antibodies that are botii specific for target gene protein, and in particular, mutant gene 
protein, and mterfere with its activity imy be used to inhibit mutant target gene function. Such 
antibodies may be generated against the protems fliemselves or against peptides coirespondmg to 
portions of the proteins using standard techniques known m the art and as also described herem. Such 
antibodies include but are not Umited to polyclonal, monoclonal. Fab firagments, single chain 
antibodies, chimeric antibodies, etc. 
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la instances where the target gene protein is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. However, lipofectin liposomes may be used to deliver the 
antibody or a fragment of the Fab region which binds to the target gene epitope into cells. Where 
j&ragments of the antibody are used, the smallest inhibitory fragment which binds to the target or 
5 expanded target protein's binding domain is preferred. For example, peptides having an amino acid 
sequence corresponding to the domain of the variable region of the antibody that binds to the target 
gene protein may be used. Such peptides may be synthesized chemically or produced via 
recombinant DNA technology using methods well known m the art (see, e.g,j Creighton, Proteins: 
Structiffes and Molecular Principles (1984) W.H. Freeman, New York 1983, supra; and Sambrook, et 

10 a/., 1989, supra). Alternatively, smgle cham neutraUzmg antibodies which bind to intracellular target 
gene epitopes may also be administered. Such smgle chain antibodies may be adininistered, for 
exani^le, by expressing nucleotide sequences encoding smgle-chain antibodies within the target cell 
population by utilizing, for exan5)le, techniques such as those described in Marasco, et aU Proc. Natl 
Acad, Scl £/SA, 90:7889-93 (1993). 

1 5 RNA sequences encoding TARGET gene protein may be dkectiy administered to a patient 

exhibiting disease symptoms, at a concentration sufficient to produce a level of target gene protein 
such that disease symptoms are ameliorated. Patients may be treated by gene replacement therapy. 
One or more copies of a normal target gene, or a portion of the gene that directs the production of a 
normal target gene protein with target gene function, may be inserted into cells usmg vectors which 

20 mclude, but are not limited to adenovhus, adeno-associated vmis, and retrovirus vectors, in addition 
to other particles that mtroduce DNA into cells, such as liposomes. Additionally, techniques such as 
those described above may be utilized for the introduction of normal target gene sequences into 
human cells. 

Cells, preferably, autologous cells, containing normal target gene e^tpressing gene sequences 
25 may then be introduced or reintroduced into the patient at positions which allow for the amelioration 
of disease synoptoms. 

Pharmaceutical Compositions, Effective Dosages, and Routes nf Administrflrirm 
The identified con^KJunds that uihibit target mutant gene expression, synthesis and/or activity 
can be administered to a patient at therapeutically effective doses to treat or ameliorate the disease. A 
30 therapeutically effective dose refers to that amount of the compound sufficient to result in 
amelioration of sjmoptoms of the disease. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard pharma- 
ceutical procedures in cell cultures or experimental anunals, e.g., for determkdng the LD50 (flie dose 
lethal to 50% of the population) and the ED50 (tiie dose flierapeutically effective in 50% of the 
35 population). The dose ratio between toxic and therapeutic eflEects is the therapeutic mdex and it can 
be expressed as the ratio LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. 
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WhUe conqjounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to nunimize 
potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the ceU culture assays and animal studies can be used in formulating a 
5 range of dosage for use in humans. The dosage of such conq)ounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
ftiis range depending upon the dosage form employed and the route of adimnistration utilized. For 
any conq)ound used in the method of the mvention, the therapeutically effective dose can be estunated 
initially from cell culture assays. A dose may be formulated in aninaal models to achieve a circulating 

10 plasma concentration range that includes the IC50 (ie., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can 
be used to more accurately detecmme useful doses in humans. Levels in plasma naay be measured, for 
exan^le, by high performance liquid chromatography. 

Pharmaceutical compositions for use in accordance with the present invention may be 

1 5 formulated in conventional manner using one or more physiologically acceptable carriers or 

excipients. Thus, the conq)ounds and ttieir physiologically acceptable salts and solvates may be 
formulated for administration by mhalation or insufflation (either through the mouth or the nose) or 
oral, buccal, parenteral, topical, subcutaneous, intraperitoneal, intravenous, intrapleural, intraoccular, 
intraarterial, or rectal administration. It is also contemplated that pharmaceutical compositions may 

20 be administered with other products that potentiate the activity of the compound and optionally, may 
include other therapeutic ingredients. 

For oral administration, the pharmaceutical compositions may take the form of, for exanqple, 
tablets or capsules prepared by conventional means with pharmaceutically acceptable excipients such 
as bmding agents (e.g., pregelatinized maize starch, polyvinylpyrrolidone or hydroxypropyl 

25 methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium hydrogen phosphate); 
lubricants (e.g., magnesium stearate, talc or silica); disiategrants (e.^„ potato starch or sodiimi starch 
glycolate); or wetting agents sodium lauryl sulphate). The tablets may be coated by methods 
well known m the art. Liquid preparations for oral administration may take the fonn of, for cxmnph, 
solutions, syrups or suspensions, or they may be presented as a dry product for constitution with water 

30 or other suitable vehicle before use. Such liquid pr^arations may be prepared by conventional means 
with pharmaceutically acceptable additives such as suspendmg agents (e.g., sorbitol syrup, cellulose 
derivatives or hydrogenated edible fats); emulsifying agents (e,g„ lecithin or acacia); non-aqueous 
vehicles almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives 
(e,g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer 

35 salts, flavoring, coloring and sweetening agents as appropriate. 
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Preparations for oral administratioa may be suitably formulated to give controlled release of 
the active con:5)ound. 

For buccal administration the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 
5 For adrainistration by inhalation, the compounds for use according to the present invention 

are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a 
nebulizer, with the use of a suitable propellant, e,g,, dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol 
the dosage unit may be detennrned by providing a valve to deliver a metered amoimt. Capsules and 
10 cartridges of e,g, gelatin for use m an inhaler or insufflator may be formulated containing a powder 
mix of the conqpound and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.^., by bolus 
mjection or contuiuous mfusion. Foimulations for injection may be presented in unit dosage form, 
m ampoules or in multi-dose containers, with an added preservative. The compositions may take 
15 such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 

formulatory agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the active 
ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-firee 
water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 

20 retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 

glycerides. Oral ingestion is possibly the easiest method of taking any medication. Such a route of 
administration, is generally simple and straightforward and is frequentiy the least inconvenient or 
unpleasant route of administration from the patient's point of view. However, this involves passing 
the material through the stomach, which is a hostile environment for many materials, including 

25 proteins and other biologically active compositions. As the acidic, hydrolytic and proteolytic 

environment of the stomach has evolved efficientiy to digest proteinaceous materials into amino acids 
and oligopeptides for subsequent anabolism, it is hardly surprising that very littie or any of a wide 
variety of biologically active proteinaceous material, if simply taken orally, would survive its passage 
through flie stomach to be taten up by the body in the small intestine. The result, is that many 

30 proteinaceous medicaments must be taken in through another method, such as parenterally, often by 
subcutaneous, intramuscular or intravenous injection. 

Pharmaceutical con?>ositions may also include various buffers Tris, acetate, phosphate), 
solubilizers Tween, Polysorbate), carriers such as human serum albumin, preservatives 
(tbimerosol, benzyl alcohol) and anti-oxidants such as ascorbic acid m order to stabilize 

35 pharmaceutical activity. The stabilizmg agent noay be a detergent, such as tween-20, tween-80, NP- 
40 or Triton X-100.. EBP may also be incorporated into particulate preparations of polymeric 
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coii5)oimds for controlled delivery to a patient over an extended period of tune. A more extensive 
survey of components in pharmaceutical conq)ositions is found in Remington's Pharmaceutical 
Sciences, 18th ed., A. R, Gennaro, ed., MackPublishrog, Easton, Pa. (1990). 

In addition to the formulations described previously, the compounds may also be formulated 
5 as a depot preparation. Such long acting formulations may be administered by ur5)lantation (for 
exan^le subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the 
con?)ounds may be formulated with suitable polymeric or hydrophobic materials (for example as an 
emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, 
as a sparingly soluble salt. 
10 The compositions may, if desired, be presented in a pack or dispenser device which may 

contain one or more uidt dosage forms contaming the active ingredient. The pack may for example 
comprise metal or plastic fofl, such as a blister pack. The pack or dispenser device may be 
accon:q)anied by instructions for administrntion. 

Diagnostics 

15 A variety of methods may be employed to diagnose disease conditions associated with the 

target gene. Specifically, reagents may be used, for example, for the detection of flie presence of 
target gene mutations, or the detection of either over or under expression of target gene mRNA, 

According to the diagnostic and prognostic method of the present invention, alteration of the 
wild-type target gene locus is detected. In addition, the method can be parformed by detecting the 

20 wild-type target gene locus and confirming the lack of a predisposition or neoplasia, "Alteration of a 
wild-type gene'' encompasses all fbnus of mutations including deletions, insertions and point 
mutations m the codmg and noncoding regions. Deletions may be of the entire gene or only a portion 
of the gene. Point mutations may result in stop codons, fi:ameshift mutations or amino acid 
substitutions. Somatic mutations axe those which occur only in certain tissues, eg., in the tumor 

25 tissue, and are not mherited in the germlme. Gennline mutations can be found in any of a body's 
tissues and are inherited. If only a single allele is somatically mutated, an eariy neoplastic state is 
indicated. However, if both alleles are mutated, then a late neoplastic state may be mdicated. The 
finding of gene mutations thus provides both diagnostic and prognostic information. A target gene 
allele which is not deleted (e.g., that found on the sister chromosome to a chromosome carrying a 

30 target gene deletion) can be screened for other mutations, such as insertions, small deletions, and 

point mutations. Mutations foimd in tumor tissues may be linked to decreased expression of the target 
gene product. However, mutations leading to non-functional gene products may also be linked to a 
cancerous state. Point mutational events may occur m regulatory regions, such as in the promoter of 
the gene, leading to loss or duninution of expression of the mRNA. Point mutations may also abolish 

35 proper RNA processkg, leading to loss of expression of the target gene product, or a decrease in 
niRNA stability or translation efficiency. 
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One test available for detecting mutations in a candidate locus is to directly connate genomic 
target sequences from cancer patients with those from a control population. Alternatively, one could 
sequence messenger RNA after amplification, e.g., by PGR, thereby eliminating the necessity of 
determining tiie exon structure of the candidate gene. Mutations from cancer patients falling outside 
5 the coding region of the target gene can be detected by examining the non-coding regions, such as 
mtrons and regulatory sequences near or within the target gene. An early indication that mutations in 
noncoding regions are in5)ortant may come from Northern blot experiments that reveal messenger 
RNA molecules of abnormal size or abundance in cancer patients as compared to control individuals. 
The methods described herein may be performed, for example, by utilizing pre-packaged 

10 diagnostic kits comprising at least one specific gene nucleic acid or anti-gene antibody reagent 
described ha:ein, which may be conveniently used, €,g,, in clinical settings, to diagnose patients 
exhibiting disease synq)toms or at risk for developing disease. 

Any cell type or tissue, preferably monocytes, endothelial cells, dir smooth muscle cells, in 
which the gaie is expressed may be utiUzed in the diagnostics described below. 

15 DNA or RNA from the cell type or tissue to be analyzed may easily be isolated using 

procedures which are well known to those in the art. Diagnostic procedures may also be performed in 
situ direcfly upon tissue sections (fixed and/or frozen) of patient tissue obtained from biopsies or 
resections, such that no nucleic acid purification is necessary. Nucleic acid reagents may be used as 
probes and/or primers for such in situ procedures (see, for example, Nuovo, PGR In Situ 

20 Hybridization: Protocols and Applications, Raven Press, N.Y. (1992)). 

Gene nucleotide sequences, either RNA or DNA, may, for example, be used in hybridization 
or amplification assays of biological sanq)les to detect disease-related gene structures and expression. 
Such assays may include, but are not limited to, Southern or Northern analyses, restriction fragment 
length polymorphism assays, single stranded conformational polymorphism analyses, in situ 

25 hybridization assays, and polymerase chain reaction analyses. Such analyses may reveal both 
quantitative aspects of the expression pattern of the gene, and qualitative aspects of the gene 
expression and/or gene coiiq)osition. That is, such aspects may include, for example, point mutations, 
insertions, deletions, chromosomal rearrangements, and/or activation or inactivation of gene 
expression, 

30 Preferred diagnostic methods for the detection of gene-specific nucleic acid molecules may 

involve for example, contacting and incubating nucleic acids, derived from the cell type or tissue 
being analyzed, with one or more labeled nucleic acid reagents under conditions favorable for the 
specific annealing of these reagents to thek complementary sequences withm the nucleic acid 
molecule of interest. Preferably, the lengths of these nucleic acid reagents are at least 9 to 30 

35 nucleotides. After incubation, all non-annealed nucleic acids are removed from the nucleic 

acidrfingerprint molecule hybrid. The presence of nucleic acids fcom the fingerprint tissue which 
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have hybridized, if any such molecules exist, is thea detected. Using such a detection scheme, the 
nucleic acid from the tissue or cell type of interest may be immobilized, for example, to a solid 
support such as a membrane, or a plastic surface such as that on a microtitre plate or polystyrene 
beads. Iq this case, after incubation, non-annealed, labeled nucleic acid reagents are easily removed. 
5 Detection of the remaining, annealed, labeled nucleic acid reagents is accoirq>lished usiog standard 
techniques well-known to those in the art. 

Alternative diagnostic methods for the detection of gene-specific nucleic acid molecules may 
involve then: amplification, e.g., by PGR (the experimental embodiment set forth m MuUis U.S. Pat. 
No. 4,683,202 (1987)), ligase chain reaction (Barany, Proc. Natl Acad, ScL USA, 88:189-93 (1991)), 

10 self sustained sequence replication (Guatelli, et al, Proc. Natl Acad Sci. USA, 87:1874-78 (1990)), 
transcriptional amplification system (Kwoh, etd., Proc. Natl Acad Sci. C/SA, 86:1173-77 (1989)), 
Q-Beta Replicase (Lizardi et al., Bio/Technology, 6: 1 197 (1988)), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using teclmiques well 
known to those of skill in the art. These detection schemes are especially useful for the detection of 

15 nucleic acid molecules if such molecules are present in very low numbers. 

In one embodiment of such a detection scheme, a cDNA molecule is obtained from an RNA 
molecule of mterest (e.g., by reverse transcription of the RNA molecule into cDNA). Cell types or 
tissues fiom which such RNA may be isolated include any tissue in which wild type fingerprint gene 
is known to be expressed, including, but not limited, to monocytes, endothelium, and/or smooth 

20 . muscle. A sequence withm the cDNA is then used as the ten^late for a nucleic acid amplification 
reaction, such as a PGR amplification reaction, or the like. The nucleic acid reagents used as 
synthesis initiation reagents (e.g., primers) in the reverse transcription and nucleic acid aii5)lificatiQn 
steps of this method may be chosen from among the gene nucleic acid reagents described herein. The 
preferred lengflis of such nucleic acid reagents are at least 15-30 nucleotides. For detection of die 

25 amplified product; the nucleic acid anqjlification may be performed using ladioactively or non- 

radioactively labeled nucleotides. Alternatively, enough aniplified product may be made such that the 
product may be visualized by standard ethidium bromide staining or by utiMng any other suitable 
nucleic acid staining method. 

Antibodies directed against wild type or mutant gene peptides may also be used as disease 

30 diagnostics and prognostics. Such diagnostic methods, may be used to detect abnormalities in the 

level of gene protein expression, or abnormalities in the structure and/or tissue, cellular, or subcellular 
location of fingerprint gene protein. Structural differences may include, for example, differences in 
the size, electronegativity, or antigenicity of the mutant fingerprint gene protein relative to the normal 
fingerprint gene protein. 

35 Protem from the tissue or cell type to be analyzed may easily be detected or isolated usmg 

techniques which are well known to those of skill in die art, mcludmg but not limited to western blot 
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analysis. For a detailed explanation of methods for carrying out western blot analysis. (SeCj e.g, 
Sambrook, et aL (1989) supra^ at Chapter 18). The protein detection and isolation methods employed 
herein may also be such as those described in Harlow and Lane, for exanqile, (Antibodies: A * 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1988)). 
5 Preferred diagnostic methods for the detection of wild type or mutant gene peptide molecules 

may involve, for exan^le, immunoassays wherein fingerprint gene peptides are detected by their 
interaction with an anti-fingerprint gene-specific peptide antibody. 

For example, antibodies, or fragments of antibodies useful in the present invention may be 
used to quantitatively or quaUtatively detect the presence of wild type or mutant gene peptides. This 
10 can be acconq)lished, for exanq)le, by immunofluorescence techniques employing a fluorescendy 
labeled antibody (see below) coupled wifli light microscopic, flow cytometric, or fluarimetric 
detection. Such techniques are especially preferred if die fingerprint gene peptides are expressed on 
the cell surface. 

The antibodies (or fragments thereof) useful in the present invention may, additionally^ be 
15 employed histologically, as m immunofluorescence or immunoelectron microscopy, for in situ 
detection of fingerprint gene peptides. In situ detection may be accon5)lished by removing a 
histological specimen from a patient, and applying thereto a labeled antibody of the present invention. 
The antibody (or fi:agment) is preferably applied by overlaying the labeled antibody (or fiagment) 
onto a biological sample. Through the use of such a procedure, it is possible to determine not only the 
20 presence of the fingeiprint gene pqptides, but also their distribution in the examined tissue. Usuig the 
present invention, those of ordinary skill will readily perceive that any of a wide variety of 
histological methods (such as staining procedures) can be modified in order to achieve such in situ 
detection. 

Immunoassays for wild type, mutant, or expanded fingerprint gene peptides typically 
25 comprise incubating a biological sanjple, such as a biological fluid, a tissue extract, fieshly harvested 
cells, or cells which have been incubated in tissue culture, m the presence of a detectably labeled 
antibody capable of identifymg fingerprint gene peptides, and detecting the bound antibody by any of 
a number of techniques well known in the art. 

The biological san:q)Ie may be brought in contact with and immobilized onto a solid phase 
30 support or carrier such as nitrocellulose, or other solid support which is capable of unmobilizing ceUs, 
cell particles or soluble proteuis. The support may then be washed with suitable buffers followed by 
treatment with the detectably labeled gene-specific antibody. The solid phase support may then be 
washed with the buffer a second time to remove unbound antibody. The amount of bound label on 
solid support may then be detected by conventional means. 
35 The terms "soUd phase support or carrier" are intended to encon:5)ass any support capable of 

binding an antigen or an antibody. Well-known supports or carriers include glass, polystyrene. 
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polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabhros, and magnetite. The nature of the carrier can be either soluble to some 
extent or insoluble for the purposes of the present invention. The support material may have virtually 
any possible structural configuration so long as the coupled molecule is capable of binding to an 
5 antigen or antibody. Thus, the support configuration may be spherical, as in a bead, or cylindrical, as 
in the inside surface of a test tube, or the extemal surface of a rod. Alternatively, the surface may be 
flat such as a sheet, test strip, etc. Preferred supports include polystyrene beads. Those skilled in the 
art wiU know many other suitable carriers for binding antibody or antigen, or will be able to ascertain 
the same by use of routine experimentation. 
10 The bmding activity of a given lot of anti-wild type or -mutant fingerprint gene peptide 

antibody may be determined accordmg to well known methods. Those skilled in the art will be able 
to determine operative and optimal assay conditions for each determination by employing routine 
experimentation. 

One of the ways in which the gene peptide-specific antibody can be detectably labeled is by 

15 linking the same to an enzyme and usmg it m an enzyme immunoassay (EIA) (VoUer, Ric Clin Lab, 
8:289-98 (1978) ["The Enzyme linked Immunosorbent Assay (ELISA)", Diagnostic Horizons 2: 1-7, 
1978, Microbiological Associates Quarterly Publication, Walkers ville, Md.]; Voller, etaL, /. Clin, 
Pathol, 31:507-20 (1978); Butier, Metk Enzymol, 73:482-523 (1981); Maggio (ed.). Enzyme 
Immunoassay, CRC Press, Boca Raton, Ha. (1980); Ishikawa, et al, (eds.) Enzyme Immunoassay, 

20 Igaku-Shom, Tolq^o (198 1)). The enzyme which is bound to the antibody will react with an 

appropriate substrate, preferably a chromogenic substrate, m such a manner as to produce a chemical 
moiety which can be detected, for example, by spectrophotometric, fluorimetric or by visual means. 
Enzymes which can be used to detectably label the antibody include, but are not limited to, malate 
dehydrogenase, staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase^ 

25 alpha-glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxid^^e, alkaline 
phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be 
accomplished by colorimetric methods which employ a chromogenic substrate for the enzyme. 
Detection may also be accomplished by visual con^arison of the extent of enzymatic reaction of a 

30 substrate in conoparison wifli similarly prepared standards. 

Detection may also be accon5)lished using any of a variety of other immunoassays. For 
exaicple, by radioactively labeling the antibodies or antibody fragments, it is possible to detect 
fingerprint gene wild type, mutant, or e7q)anded peptides through the use of a radioimmunoassay 
(RIA) (see, e.g., Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on 

35 Radioligand Assay Techniques, The Endooine Society, March, 1986). The radioactive isotope can be 
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detected by such means as tbe use of a gamma counter or a scintiflatLon counter or by autoradi- 
ography. 

It is also possible to label the antibody with a fluorescent compound. When the fluorescenfly 
labeled antibody is exposed to light of the proper wave length, its presence can then be detected due to 
5 fluorescence. Among the most commonly used fluorescent labeling compounds are fluorescem 
isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and 
fluorescamme. 

The antibody can also be detectably labeled using fluorescence emitting metals such as 
152Eu, or others of the lanfhanide series. These metals can be attached to the antibody using such 
10 metal chelating groups as diethylenetriammepentacetic acid (DTPA) or ethylenediamine-tetniacetic 
acid (EDTA). 

The antibody also can be detectably labeled by coupling it to a chemiluminescent compound. 
The presence of the chemiluminescent-tagged antibody is then determmed by detecting the presence 
of luminescence that arises during the course of a chemical reaction. Examples of particularly useful 
15 chemilummescent labeling coEopounds are luminol, isolummol, theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

Likewise, a biolummescent compound may be used to label the antibody of the present 
invention. Bioluminescence is a type of chemilummescence found in biological systems in, which a 
catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a 
20 biolummescent protein is detOToined by detectmg the presence of lununescence. Iii5»ortant 
bioluminescent confounds for purposes of labeling are luciferin, luciferase and aequorin. 

Throughout this application, various publications, patents and published patent applications 
are referred to by an identifying citation. The disclosures of these publications, patents and published 
patent specifications referenced m this application are hereby incorporated by reference into the 
25 present disclosure to more fiilly describe the state of the art to which this invention pertains. 

The following examples are intended only to illustrate the present invention and should in no 
way be constmed as limiting the subject invention. 

Examples 

Example 1: Generation and Analyi^ of Mice Compri^g GPCR Gene Disruptions 

30 Targeting Construct fjr Platelet Activating Factor Receptor Gene, To investigate the role 

of genes encoding GPCRs, particularly, platelet activating factor receptor genes, dismptions in genes 
comprising the sequence set forth in SEQ ID N0:1 were produced by homologous recombination. 
More particularly, as shown in Figure 2A-2B, a specific targetmg construct having the ability to 
dismpt or modify genes, specifically con^rishxg SEQ ID NO: 1 was created using as the targetmg 

35 arms (homologous sequences) in the construct, the sequences identified herein as SEQ ID NO: 3 and 
SEQIDN0:4. 
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Transgenic Mice. The targeting construct was introduced into ES cells by electroporation and 
chimeric mice were generated. The ES cells were derived from 129/Sv-+P+Mgf-SLJ/J mouse 
substram. Fl mice were generated by breeding with C57BL/6 females. The resultant FINO hetero- 
zygotes were intercrossed to produce E2N0 homozygotes, or were backcrossed to C57BL/6 mice to 
generate FlNl heterozygotes. F2N1 homozygous mutant mice were produced by intercrossing FlNl 
heterozygous males and females. 

Phenotypic Analysis. The transgenic mice were analyzed for phenotypic changes. The 
homozygous mice demonstrated at least one of the following behavior phenotypes: 

Homozygous mice spent significantly more tune m the central region on the open field test 
than the wild-type mice as shown in Figure 4 and the data presented in Table I below. This indicates 
that the homozygous mice may have less anxiety in comparison to the wild-type mice. 



TABLE 1 - OPEN FIELD TEST: TIME SPENT IN CENTRAL REGION 
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Generation 
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Generation 
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SubFamily 


Gene 
Name 


Time 
(sec) 


Std. 
Err. 


Count 


+/+ 


2 
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GPCR 
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GPCR 


platelet 
activating 
receptor 
homolog 


15.69 


4.81 


3 


+/+ 


2 
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GPCR 


Orphan 
GPCR 


platelet 
activating 
receptor 
homolog 


34.36 


6.20 


9 


-/- 


2 1 
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GPCR 


Oiphan 
GPCR 


platelet 
activating 
receptor 
homolog 


18.35: 


6.06 


6 


-1- 


2 


1 


GPCR 


Oiphan : 
GPCR 


platelet 
activating 
lec^tor 
homolog 


49.07 


4.13 


11 



Homozygous mice had significantly longer latencies to fan or lick their hind paw on the hot 
plate test as shown m Figure 3 and the data presented in Table 2 below. More particularly, the 
homozygous mice displayed an mcreased response latency to lick or fan thek hind paw on the hot 
plate test, indicating that the homozygous mice may have a higher pain threshold in con5)arison to the 
wild-type mice. 
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Generation 
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Gene 
Name 


Time 
(sec) \ 


Std. 
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activating 
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19.50 


4.74 


8 
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GPCR 
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GPCR 
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activating 
receptor 
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23.41 


4.50 


11 



10 



15 



20 



Example 2: Generation and Analysis of Mice Comprising GPCR Gene Disruptions 

Targeting Construct for PAF Receptor Gene. To investigate the role of genes encoding 
GPCRs, particularly, PAF receptor genes, disruptions in genes conoprising the sequence set forth in 
SEQ ID N0:4 were produced by homologous recombmation. More particularly, as shown in Figure 
6A-6B, a specific targeting construct having the ability to disrupt or modify genes, specifically 
comprising SEQ ID N0:4 was created using as the targeting arms (homologous sequences) in the 
construct, the sequences identified herem as SEQ ID N0:5 and SEQ ID N0:6. 
Example 3: Generation and Analysis of Mice Comprising LPR5 Gene Disruptions 

Targeting Construct To investigate the role of LPR5, disruptions in genes comprising the 
sequence set forth ui SEQ ID NO:7 were produced by homologous recombination. More particularly, 
as shown in Rgure 9A-9C, a specific targeting construct having the ability to disrupt or modify genes, 
specifically comprising SEQ ID N0:7 was created using as tiie targeting arms (homologoixs 
sequences) in the construct, tiie sequences identified herein as SEQ ID N0:9 and SEQ ID NO: 10. 

Transgenic Mice. The targeting construct was introduced into ES cells derived from the 
129/OlaHsd mouse substrain by electroporation to generate chimeric mice. Fl mice were generated 
by breeding with C57BL/6 females. The resultant FINO heterozygotes were intercrossed to produce 
P2N0 homozygotes, or were bacfccrossed to C57BU6 mice to generate FlNl heterozygotes. E2N1 
homozygous mutant mice were produced by intercrossing FlNl heterozygous males and females. 

Phenotypic Analysis. The transgenic mice were analyzed for phenotypic changes. The 
homozygous mice demonstrated eye abnormalities, including retinal regeneration. 
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Specifically, histopathology studies demonstrated that the eyes of the homozygous mice 
suffered from retinal degeneration, including bilateral retinal regen^ation. In each homozygous 
mutant, at least one of the following retinal changes were present: retinal folds; thinning and 
vacuolation of the pigment epithelium layer; degeneration of photoreceptors; thinning, disorganiza- 
tion, and pyknosis of the outer nuclear layer, thinning and disorganization of the outer plexiform 
layer, including juxtaposition of the photoreceptor nuclei and the bipolar cell or inner nuclear layer; 
disorganization of the inner nuclear layer; thmning of the mner plexiform layer; loss of ganglion cell 
nuclei, especially large ganglion cells; and, gliosis of the nerve fiber layer. The changes were general- 
ly more prominent in the outer layers of the retina (photoreceptor layers) and least pronounced m the 
inner layers (inner nuclear layer, mner plexiform layer, ganglion cell layer, and nerve fiber layer). 

Behavior Analysis: The homozygous mice demonstrated at least one of the following 
behavior phenotypes: 

The homozygous mice spent significantly less time in the central region in the open field test, 
indicating possible increased anxiety as conq)ared to wild-type mice. This is shown m Figure 10, 

The homozygous mice displayed a decrease in total distance traveled in the open field test 
The homozygous mutants were significantly hypoactive, in that they moved about and ejjplored the 
open field less than wild-type mice. This is shown in Figure U. 

Example 4: Generation and Analysis of IVGce Comprising Cerberus Gene Disruptions 

Targeting Construct To investigate the role of cerberus genes, disruptions in genes 
comprismg the sequence set forth in SEQ ID NO: 1 1 were produced by homologous recombination. 
More particularly, as shown in Figure 13A-13B, a specific targeting construct having the ability to 
disrupt or modify genes, specifically conqirismg SEQ ID NO: 1 1 was created using as the targetmg 
arms (homologous sequences) in the construct, the sequences identified herem as SEQ ID NO: 13 and 
SEQ ID NO: 14. 

Transgenic Mice. The targeting construct was introduced into ES cells derived fix)m the 
129/SvJ X 129/Sv-CP mouse by electroporation to generate chimeric mice, Fl mice were generated 
by breeding with C57BL/6 females. The resultant FINO heterozygotes were mtercrossed to produce 
F2N0 homozygotes, or were backcrossed to 0578176 mice to generate FlNl heterozygotes. F2N1 
homozygous mutant mice were produced by intercarossmg FlNl heterozygous males and females. 

Phenotypic Analysis, The transgenic mice were analyzed for phenotypic changes. The 
homozygous mice at least one of the following behavioral phenotypes: 

The homozygous mice displayed several differences during the open field test. Homozygous 
mice displayed a decrease in their average velocity during episodes of movement, and total distance 
traveled, indicating decreased activity (e.g., hypoactivity). In addition, homozygous mutants 
displayed an increase in the number of fecal boli deposited during the ten-minute test, suggesting 
increased anxiety or nervousness. 
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The homozygous mice displayed a difference during the tail suspension test Homozygous 
mice were more active during the six-ramute test resulting in a decrease m immobility time, 
suggesting that the homozygote mice exhibit a decreased susceptibility to depression (anti-depressive 
behavior phenotype). 

In smmnary, homozygous mutant mice displayed at least one of the following behaviors: 

1) A decrease in average velocity of movement during open field testing con^ared to wild- 
type mice; 

2) A decrease in total distance traveled during open field testmg compared to wild-type 
mice (see Table 3 below and Figure 14); 

3) An increase in the number of fecal boli during open field testing con5)ared to wild-type 
mice; and 

4) A decrease in total time immobile duriag the tail suspension test (see Table 4 below and 
Figure 15). 



TABLE 3 - OPEN FIELD TEST: TOTAL DISTANCE TRAVELED 
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TABLE 4 - TAIL SUSPENSION TEST: TOl^L TIME IMMOBILE 
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£3£ample 5: Generation and Analysis of Mice Comprising Brainiac Gene Disruptions 

Targeting Construct. To investigate the role of brainiac genes, disruptions in genes 
5 comprising the sequence set forth in SEQ ID NO: 17 were produced by homologous recombmation. 
More particularly, a specific targeting construct having the ability to disn^t or modify genes, 
specifically comprismg SEQ ID NO: 17 was created usmg as the targeting arms (homologous 
sequences) m the construct, the sequences identified herein as SEQ ID NO: 15 and SEQ ID NO: 16. 
Transgenic Mice, The targeting construct was introduced mto ES cells derived fi:om 
10 129/OlaHsd mouse substrain by electroporation to generate chimeric mice. Fl mice were generated 
by breeding with C57BL/6 females. The resultant FINO heterozygotes were intercrossed to produce 
F2N0 homozygotes, or were backcrossed to C57BLy6 mice to generate FlNl heterozygotes. F2N1 
homozygous mutant mice were produced by intercrossmg FlNl heterozygous males and females. 

As is apparent to one of skill m die art, various modifications of the above embodiments can 
15 be made without departing from the spirit and scope of this invention. These modifications and 
variations are within the scope of this invention. 
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We claim: 

1. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to a target gene; 

(b) a second polynucleotide sequence homologous to the target gene; and 

(c) a selectable marker. 

2. The targeting construct of claim 1, wherein the targeting construct further comprises a screening 
marker. 

3. A method of producmg a targeting construct for a target gene, the method comprising: 

(a) obtaining a first polynucleotide sequence homologous to a target gene; 

(b) obtaining a second polynucleotide sequence homologous to the target gene; 

(c) providing a vector comprising a selectable marker; and 

(d) inserting the first and second sequences into the vector, to produce the targetmg construct. 

4. A method of producing a targeting construct for a target gene, die method comprising: 

(a) providing a polynucleotide comprising a first sequence homologous to a first region of a 
target gene and a second sequence homologous to a second region of a target gene; and 

(b) inserting a positive selection marker in between the first and second sequences to form the 
targeting construct. 

5. A cell comprising a disruption in a target gene. 

6. The cell of claim 5, wherein the cell is a murine cell. 

7. The cell of claim 6, wherein the murine cell is an embryonic stem cell. 

8. A non-human transgenic animal con^rising a disruption in a target gene. 

9. A cell derived firom the non-human transgenic animal of claim 8. 

10. A method of producing a transgenic mouse comprising a disruption in a target gene, the method 
comprising: 

(a) introducing the targeting construct of claim 1 into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implanting die resultmg blastocyst into a pseudopregnant mouse, wherein said pseudopregnant 
mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse. 

1 1. A method of identifying an agent fliat modulates the expression of a target gene, die method 
conq)rising: 

(a) providing a non-iiuman transgenic animal comprismg a disruption in flie target gene; 
(b) administering an agent to ttie non-human transgenic animal; and 
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(c) detennining whether the expression of the disrupted target gene in the non-human transgenic 
animal is modulated. 

12. A method of identifying an agent that modulates the function of a target gene, the method 
comprising: 

(a) providmg a non-human transgenic animal comprising a disruption in the target gene; 

(b) administering an agent to the non-human transgenic animal; and 

(c) determining whether the function of the disrupted target gene in the non-human transgenic 
animal is modulated. 

13. A method of identifying an agent that modulates ttie expression of a target gene, the method 
comprising: 

(a) proYidmg a cell comprismg a disruption in a target gene; 

(b) contactmg the cell with an agent; and 

(c) determining whether expression of the target gene is modulated. 

14. A method of identifying an agent that modulates the function of a target gene, the method 
con5)rising: 

(a) providing a cell comprising a disruption m a target gene; 

(b) contacting the cell with an agent; and 
(c) determining whether the function of the target gene is modulated. 

15. The method of claim 13 or claim 14, wherein the cell is derived from the non-human transgenic 
animal of claims. 

16. An agent identified by the mefliod of claim 11, claim 12, claim 13, or claim 14. 

17. A transgenic mouse con^rising a disruption in a platelet activating receptor gene, wherein the 
transgenic mouse exhibits less anxious behavior as compared to wild-type mice. 

18. A transgenic mouse conqirismg a dismption in a platelet activatmg receptor gene, wherein the 
transgenic mouse exhibits a higher pain threshold as compared to wild-type mice. 

19. The transgenic mouse of claim 17, wherem the transgenic mouse is heterozygous for a disruption in a 
platelet activating receptor gene. 

20. Ihe transgenic mouse of claim 17, wherem the transgenic mouse is homozygous for a disruption in a 
platelet activating receptor gene. 

21. A method of produchig a transgenic mouse comprising a disruption in a platelet activating receptor 
gme, wherdn the transgenic mouse exhibits at least one of the following behaviors: less anxiety as 
con^ared to wild-type mice or higher pain threshold as compared to wild-type mouse, the method 
comprising: 
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la) introducing a disruption in a platelet activating receptor gene targeting construct into a cell; 

(b) introducing ftie cell into a blastocyst; 

(c) inq)lanting the resulting blastocyst into a pseudopregnant mouse, wherdn said pseudopregnant 
mouse gives birHi to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in a 
disruption in a platelet activating receptor gene. 

22. A cell derived fiiom the transgenic mouse of claim 17, claim 18 or claim 21, wherein the cell 
comprises a disruption in a platelet activatmg receptor gene. 

23. A method of identifying an agent which modulates platelet activating receptor expression, the method 
comprising: 

(a) administering an agent to the transgenic mouse comprising a disnqition in a platelet 
activating receptor gene; and 

(b) determining whether the agent modulates platelet activating receptor expression in the 
transgenic mouse, wherein the agent modulates a phenotype associated with a dismption in a 
platelet activating receptor gene. 

24. A method of identifying an agent which modulates a phenotype associated with a disruption in a 
platelet activating receptor gene, the method con[5)rising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a platelet activattag 
receptor gene; and 

(b) determining whether the agent modulates the phenotype. 

25. A method of identifying an agent which modulates platelet activating receptor expression, the mefiiod 
comprising: 

(a) providing a cell comprising a disruption in a platelet activating receptor gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the agent modulates platelet activating receptor gene expression, 
wherein the agent modulates a phenotype associated with a disruption in a platelet activating 
receptor gene. 

26. A inethod of identifying an agent which modulates platelet activatuig receptor gene function, the 
method comprising: 

(a) providing a cell comprising a disruption in a platelet activating receptor gene; 

(b) contacting the cell with an agent; and 
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(c) determining whether the agent modulates platelet activating receptor gene function, wherein 
the agent modulates a phenotype associated with a disruption in a platelet activating receptor 
gene. 

27. An agent identified by the method of claim 23, claim 24, claim 25, or claim 26. 

28. A transgenic mouse comprising a disruption in a LPR5 gene, wherein the transgenic mouse exhibits 
an eye abnormality. 

29. The transgenic mouse of claim 28, wherein the eye abnormality is a retinal abnormality. 

30. The transgenic mouse of claim 29, wherein the retinal abnormaUty is characterized by retinal 
degeneration. 

31. The transgenic mouse of claim 30 wherein the transgenic mouse is heterozygous for a disn:?)tion in 
anLPRSgene. 

32. The transgenic mouse of claim 31, wherein the transgenic mouse is homozygous for a disruption in 
an LPR5 gene. 

33. A method of producing a transgenic mouse comprismg a disruption in an LPR5 gene, wherein the 
transgenic mouse exhibits an eye abnormality, the method comprismg: 

(a) introducing an LPR5 targetmg construct into a cell; 

(b) introducing the cell into a blastocyst; 

(c) implahtmg the resultrag blastocyst into a pseudopregnant mouse, wherein said pseudopregnant 
mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a disruption in an 
LPR5 gene. 

34. A ceil derived from the transgenic mouse of claim 28 or claim 33. 

35. A naethod of identifying an agent that ameHorabcs an eye abnormality, the method comprisiag: 

(a) administering an agent to the transgenic mouse comprising a dismption in a LPR5 gene; and 

(b) determining whether the agait ameliorates the eye abnormaUty in the transgenic mouse. 

36. The method of claim 35, wherein the eye abnormaUty is a retinal abnormaUty. 

37. A method of identifying an agent which modulates LPR5 gene expression, die method comprising: 

(a) administering an agent to the transgenic mouse comprisiag a disruption in a LPR5 gene; and 

(b) determining whether the agent modulates LPR5 expression in the transgenic mouse, wherein 
the agent modulates a phenotype associated with a disruption in a LPR5 gene. 

38. The noethod of claim 37, wherem the phenotype comprises an eye abnormaUty. 

39. The method of claim 38, wherein the eye abnormaUty is a retinal abnormaUty. 

40. A method of identifying an agent which modulates LPR5 gene expression, the method comprising: 
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(a) providing a cell coirq)rising a disruption in a LPR5 gene; 

(b) contacting the cell with the agent; and 

(c) determining whether the agent modulates LPR5 gene expression, wherein the agent 
modulates a phenotype associated with a disruption in a LPR5 gene. 

41 . Hie method of claim 40, wherein the phenotype is an eye abnormality. 

42. The method of claim 41, wherein the eye abnormality is a retinal abnormality. 

43. A method of identifying an agent which modulates LPR5 gene function, the method comprising: 

(a) providing a cell comprising disruption in a LPR5 gene; 

(b) contacting the cell with an agent; and 

(c) determining whether the agent modulates LPR5 gene function, wherein the agent modulates a 
phenotype associated with a disruption in a LPR5 gene. 

44. The method of claim 43, wherein the phenotype is an eye abnormality, 

45. The method of claim 44, wherein the eye abnormality is a retinal abnormality. 

46. A method of identifying an agent which modulates a phenotype associated with a disruption in a 
LPR5 gene, the method con^sing: 

(a) administering an agent to a transgenic mouse comprising a disruption in a LPR5 gene; and 

(b) detenniniag whether ftie agent modulates the phenotype. 

47. The method of claim 46, wherein the phenotype is an eye abnormality. 

48. The method of claim 44, wherein the eye abnormality is a retinal abnormality. 

49. An agent identified by the method of claim 37, claim 40, claim 43, or claim 46. 

50. A transgenic mouse comprising a disruption in a cerberus gene, wherein the transgenic mouse 
exhibits at least one of the following behaviors: hypoactivity, anxiety, nervousness or a decreased 
susceptibility to depression. 

51. The transgenic moxise of claim 50, wherein the transgenic mouse is heterozygous for a disruption in 
cerberus gene. 

52. The transgenic mouse of claim 50, wherein the transgenic mouse is homozygous for a disruption in a 
cerbems gene. 

53. A method of producing a transgenic mouse con^rising a disruption in a cerbems gene, whercin the 
transgenic mouse exhibits at least one of the following behaviors: hypoactivity, anxiety, nervousness or a 
decreased susceptibUity to depression, the method comprising: 

(a) introducing a cerberus gene targeting construct into a cell; 

(b) introducing the cell mto a blastocyst; 
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(c) implantiag the resulting blastocyst into a pseudopregnant mouse. \^4ierein said pseudoprcgnant 
mouse gives birth to a chimeric mouse; and 

(d) breeding the chimeric mouse to produce the transgenic mouse comprising a dismption in a 
Cerberus gene. 

54. A cell derived from the transgenic mouse of claim 50 or claim 53. 

55. A method of identifying an agent that ameliorates hypoactive, anxious or nervous behavior, the 
method comprising: 

(a) administering an agent to a transgenic mouse comprising a dismption m a cerberus gene; and 

(b) determining whether the agent ameliorates behavior of the transgenic mouse. 

56. A method of identifying an agent which modulates cerberus gene expression, the method comprising: 

(a) administering an agent to the transgenic mouse comprising a disruption in a cerberus gene; 
and 

(b) determining whether the agent modulates cerberus expression in the transgenic mouse, 
wherein the agent has an effect on hypoactive, anxious, or nervous behavior of the transgenic 
mouse. 

57. A method of identifying an agent which inodulates hypoactive, anxious or nervous behavior 
associated with a disruption in a cerberus gene, the method con5)rising: 

(a) administering an agent to a transgenic mouse comprising a disruption in a cerberus gene; and 

(b) determining whether the agent modulates behavior of the transgenic mouse. 

58. An agent identified by the method of claim 55, claim 56 or claim 57. 
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ATTCGGCACTGAGGAGTTAATACACAGCTGCAAGATATACCGAATACAAGAAC 
CTGGGTTTGCCAAAATGATATCAGCTGTCGTGTGGCTCATGGTCCT6CT6ATC 
ATGGTGCCQAACATGGTGATTCCCATCAAGGACATCAAAGAAAAGTCAAATGT 
AGGTT6CATGGAGTTTAAAAAGGAGTTTGGAAGAAACTGGCATTTGCTAACAA 
ACTTCATCTGTGTGGCAATATTTTTAAATTTCTCAGTCATCATTTTGATATCC 
AACTTCCTTGCAATCAGACAACTCTATAGAAATAGAGACAACACAAACTATCC 
AAGTGTGAAATCAGCTCTGCTCCACATTCTCTTGGTGACAGCTAGTTACATCA 
TATGCTTTGTTCCCTACCACGCTGTCAGGATCCCATACACCCTCAGCCAGACA 
GAGGTCATATCTGATTGCTCAACTAGGATCGCACTCTTCAAAGCCAAAGAGGC 
CACACTGCTGCTGGCGGTGTCTAATTTGTGTTTTGATCCAATCCTGTACTATC 
ATCTGTCAAAAGCCTTTCGACTAAAGGTCACCGAGACTTTCGCCTCACCTAAA 
AAGAGCAAGCCTCTGGAAGAAAGATTGAGAAGTGAGAATGATGTGTAAAATGC 
AGGCTTTGCACGTTAGTCCTTCTTCCTGACTGGACCATAAAGCTGGTCACAGT 
TTTC (SEQ ID N0:1) 
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Construct 

Gene: 17 Gl Nuiiiber(s); 1738456 

Gene Family: GPCR 1749042 

Suwlmily: Orphan GPCR 
Gene Sequence: EST contig. Mouse 



underlined = deleted in targeting construct 

t ] = sequence flanking Neo insert in targeting construct 



A!rTCGGCACTG2lGGaG[TTAATACACAGCTGCAAGATATACCGAATACi^ 

TTGCCAAAATGATATCAGCTGTCGTGTGGCTCATGGTCCTGCTGATCATGGTGCCGAACA 

TGQTGA] TTCCCATCAAGGACATCAAAGAAAAGTCAAATGTAGQTTGCATG^ 

AAGGAGT TTGGAAGAAACTGGGACTTGCTAACAAACTTCAO^CTOTGTGGCAATATO^ 

AATCTCTCAGTCATCATO a:TOATATC(^CTTCCTTGCAATCAGA(^ 

AGAGACAACACAAACTAT CCAAGTGTGAAATCAGCTCTGCTCCACAT TCTCTTGQTaArA 

GCTAGTTACATCATATGC raTOTTCCCTACCACGCTGTCAGGATCCCATACACCC^^ 

CAGACAGAGGTCATATCTGATTOCTCAACTAGGATCGCACTCOTCAAAGCPAAAGAGGCC 

ACACTGCaXK^TGGCGGT GTCTAATTTGTGOTTTOATCCAATCOTGTACTATCATC 

AAAGCCTTTCGACTAA AGGTCACCGAGACTOTCncCTCACCTAAAAAQAGCAAGCCTCTG 

GAAGAAAGATa?GAGAAGTGAGAATQATGTGTAAAATGCAGGCTTTGCAgGTTAGTCCTTC 

TTCCTG lACTGGACCATAAAGCTGGTCACAGOTTTCl 



Gene Sequence ^ 

Structure* ^ Sequence Deleted 664 bp 



Size of EST contig: 693 bp 
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Targeting Vector* (genomic sequence) 
Construct Number: 87 



Arm Length: 
5':8kb 
3-: 1.6 kb 



— — — — -i€rido^nftU5yLq*,i^ 
* Not drawn to scale 




5 » >CCACTGCCAAGTGACAGCCTG 

CCrCATCTACATCAATATGTACTT 

ATCTATTATCTTCTTAGCCTTTGT 

CAGCATCGATCGCTGTCTITCAGTT 

AATACACAGCTGCAAGATATACCG 

AATACAAGAACCTGGGTTTGCCAA 

AATGATATCAGCTGTCGTGTGGCT 

CATGGTCCTGCTGATCATGGTGTC 

GAACA3?GGirGA-c3 » 

(SEQ ID N0:2) 



5 » >ACTGGACCATAAAGOTGGaXIA 
CAGTTTTCAAAAATAAAAGAAGAT 
GCTGAGATAGAAGAAGATGCTTA6 
GAATAAGAGCCACCAATAGCTAGC 
AAACGGGACTGGTTCATGCAiGAAA 
GTCCTTTCTTCAATCCAAGCCAAC 
CCTGCACTCTCACAGATGraGACA i 
TTCTTTACCTAACAAAGCACATTO 
AACCAAGGTCAO' 
(SEQ -ID N0:3) 
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c 
or 



30- 



25- 



m 
c 

m 

"Si? 

^ 15- 



20- 



10- 



hot plate - latency to hindpaw licking 




17 



Gen© 



I ■»-/+, F2, NO 



•»-/4-,F2,N1 



-/-, F2, NO 



I -/-,F2,N1 



©Deltagen, Inc. 



'3 
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GGCCTCTGAGAACCTTGGTGTGCCAAGTTACTTCAGTCACATGTTATTTTACA 
ATGTATATCAGTATATCGTTCCTGGGGTTGATAACCATTGACCGCTACCTGAA 
GACCACCAGGCCATTTAAAACGTCCAGCCCCAGCAATCTCTTGGGTGCAAAGA 
TTCTTTCTGTTGTCATCTGGGCCTTCATGTTCTTAATTTCACTGCCTAACATG 
ATTCTCACCAACAGGAGGCCAAAAGATAAGGACGTAACAAAATGTTCTTTCTT 
AAAGTCAGAGTTTGGTCTAGTTTGGCACGAAATAGTCAATTACATCTGCCAAG 
TCATTTTCTGGATTAATTTTTTAATTGTCATCGTTTGTTATAGCCTCATTACC 
AAAGAACTCTATCGGTCTTATGTCAGAACAAGGG6TTCAGCCAAAGTTCCCAA 
GAGAAAGGTAAACGTCAAGGTTTTCATCATCATTGCTGTATTCTTTATTTGCT 
TTGTTCCCTTCCACTTT6GACGGATTCCCTACACCCTGAGGCAGACTCGGGCC 
GTCTTTGACTGC (SEQ ID NO: 4) 
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Construct 



Gene: 43 
Gene Family: GPCR 
Gene 

Subfamily: 
Gene Sequence: EST, Mouse 



Gi Number(s): 1912557 



Orphan GPCR 



underlined = deleted in targeting constmct 

[ ] = sequence flanking Neo insert in targeting construct 



tGGCCTCTGAGAACCTOGGTOTGCCAAGTTACTTC] AGTCACATGTTATTTTACAATGTA 
TATCAGrATATCGTTCCTGGGGTTGAO'AACCATTGACCGCaJACCTOAAGACCACCAGGCC 
AI^TAAAACGTCCAGCCCCTVGCA^TCTCTTGGGTGCAAAGATTCTTTCTGTTGTC^^ 
GGCCTTCaTGTTCTOAATa?TCACTGCCTAACATOATTCTCACCaLACAGGAGGCCAA^ 
TAAGGACGTAACAAAATGTTCTTTCYTjl?^GTCAGAGTTOGGyCTAGraT^ 
AGTCAATTACATCTGCCAAGTCATTTTCTGGATTAATTTTTTAATOQTC^ 
TAGCCTCATTACCAAAGAACTCTATCGGTCTTATGTC I AGAACAAGGGGOTCAGCCAAAG 
TTCCCAAGAGAAAGGTAAATCTCAAGGTTTTCAOJCATCM^TGOTG^^ 
TTGTTCCCTTCCACOTTGCACGGATTCCCTACACCCaXSAGGCAGACTCGGGCCGTCT 
ACTGC] 



Gene Sequence 
Structure * 



Size of EST: 542 bp 



35 bp 



Sequence Deleted 



395 bp 
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Targeting Vector* (genomic sequence) 
Construct Number: 263 



Arm Length: 
5*:1.1kb 
3*: 3.3 kb 



.5*anna 



Neo 



3' Sim 




. ^fei:g^^hg Sector 

' Not drawn to scale 



B ' K:aK3TTCTTTGCTGGGCTCATC 
ACG3ACAGCTTGGCAATGAGGR.TT 
TTCTTTCAGATCCGCAGTAAATCC 
AACTTCATCATTrTTCTTAAGAAC 
ACGGa?CATCTC!rQATCaiACTAA!rG 
ATTCTAACTO^TCCArXTAAAATT 
CTTAGTGATGCTATACTGGGAGCC 
GGeCCTCTGAGAACCOTGGTGTGC 
CAAGTTACTTCO » 
(SEQ ID HO: 5) 



5 ' >AGAACAAGGGGOTCAGCCAAA t 

CTTCCCAAGAAAAAGGTAAACGTC 

AAGGTTTTCATCATCAOTGCTGTA 

TTCTTTATTTGCTTTGTTCCCTTC 

CACTTTGCACGGATTCCCTACACC 

CTGAGCC2VAACTCGGGCCGTCTIPir 

GACTGCAGTGCTGAGAACACCC!PG 

TTCTACGTGAAGGAGAGCACCCTA 

TGGCTGACGTCO ' 

(SEQ ID WO: 6) 



/yd 
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GCCGCGGCGCCCGAGGCGGGAGCAAGAGGCGCCGGGAGCCGCGAGGATCCACC 
GCCGCCGCGCGCGCCATGGAGCCCGAGTGAGCGCGCGGCGCTCCCGGCCGCCG 
GACGACATGGAAACGGCGCCGACCCGGGCCCCTCCGCCGCCGCCGCCGCCGCT 
GCTGCTGCTGGTGCTGTACTGCAGCTTGGTCCCCGCCGCGGCCTCACCGCTCC 
TGTTGTTTGCCAACCGCCGGGATGTGCGGCTAGTGGATGCCGGCGGAGTGAAG 
CTGGAGTCCACCATTGTGGCCAGTGGCCTGGAGGATGCAGCTGCTGTAGACTT 
CCAGTTCTCCAAGGGTGCTGTGTACTGGACAGATGTGAGCGAGGAGGCCATCA 
AACAGACCTACCTGAACCAGACTGGAGCTGCTGCACAGAACATTGTCATCTCG 
GGCCTCGTGTCACCTGATGGCCTGGCCTGTGACTGGGTTGGCAAGAAGCTGTA 
CTGGACGGACTCCGAGACCAACCGCATTGAGGTTGCCAACCTCAATGGGACGT 
CCCGTAAGGTTCTCTTCTGGCAGGACCTGGACCAGCCAAGGGCCATTGCCCTG 
GATCCTGCACATGGGTACATGTACTGGACTGACTGGGGGGAAGCACCCCGGAT 
CGAGCGGGCAGGGATGGATGGCAGTACCCGGAAGATCATTGTAGACTCCGACA 
TTTACTGGCCCAATGGGCTGACCATCGACCTGGAGGAACAGAAGCTGTACTGG 
GCCGATGCCAAGCTCAGCTTCATCCACCGTGCCAACCTGGACGGCTCCTTCCG 
GCAGAAGGTGGTGGAGGGCAGCCTCACTCACCCTTTTGCCCTGACACTCTCTG 
GGGACACACTCTACTGGACAGACTGGCAGACCCGCTCCATCCACGCCTGCAAC 
AAGTGGACAGGGGA6CAGAGGAAGGAGATCCTTAGTGCTCTGTACTCACCCAT 
GGACATCCAAGTGCTGAGCCAGGAGCGGCAGCCTCCCTTCCACACACCATGCG 
AGGAGGACAACGGTGGCTGTTCCCACCTGTGCCTGCTGTCCCCGAGGGAGCCT 
TTCTACTCCTGTGCCTGCCCCACTGGTGTGCAGTTGCAGGACAATGGCAAGAC 
GTGCAAGACAGGGGCTGAGGAAGTGCTGCTGCTGGCTCGGAGGACAGACCTGA 
GGAGGATCTCTCTGGACACCCCTGACTTCACAGACATAGTGCTGCAGGTGGGC 
GACATCCGGCATGCCATTGCCATTGACTACGATCCCCTGGAGGGCTACGTGTA 
CTGGACCGATGATGAGGTGC6GGCTATCCGCAGGGCGTACCTAGATGGCTCAG 
GTGCGCAGACACTTGTGAACACTGAGATCAATGACCCCGATGGCATTGCTGTG 
GACTG6GTCGCCCGGAACCTCTACTGGACAGATACAGGCACTGACAGAATTGA 
GGTGACTCGCCTCAACGGCACCTCCCGAAAGATCCTGGTATCTGAGGACGTGG 
ACGAACCGCGAGCCATTGTGTTGCACCCTGTGATGGGCCTCATGTACTGGACA 
GACTGGGGGGAGAACCCCAAAATCGAATGCGCCAACCTAGATGGGAGAGATCG 
GCATGTCCTGGTGAACACCTCCCTTGGGTGGCCCAATGGACTGGCCCTGGACC 
TGCAGGAGGGCAAGCTGTACTGGGGGGAT6CCAAAACTGATAAAATCGAGGTG 
ATCAACATAGACGGGACAAAGCGGAAGACCCTGCTTGAGGACAAGCTCCCACA 
CATTTTTGGGTTCACACTGCTGGGGGACTTCATCTACTGGACCGACTGGCAGA 
GACGCAGTATTGAAAGGGTCCACAAGGTCAAGGCCAGCCGGGATGTCATCATT 
GATCAACTCCCCGACCTGATGGGACTCAAAGCCGTGAATGTGGCCAAGGTTGT 
CGGAACCAACCCATGTGCGGATGGAAATGGAGGGTGCAGCCATCTGTGCTTCT 
TCACCCCACGTGCCACCAAGTGTGGCTGCCCCATTGGCCTGGAGCTGTTGA6T 
GACATGAAGACCTGCATAATCCCCGAGGCCTTCCTGGTATTCACCAGCAGAGC 
CACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATGTGGCTATCCCAC 
TCACGGGTGTGAAAGAGGCCTCTGCACTGGACTTTGATGTGTCCAACAATCAC 
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ATCTACTGGACTGATGTTAGCCTCAAGACGATCAGCCGAGCCTTCATGAATGG 

GAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCTGJ^GGAATGG 

CTGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAACAGG 

ATTGAGGTGGCCCGGCTGGATGGGCAGTTCCGGCAGGTGCTTGTGTGGAGAGA 

CCTTGACAACCCCAGGTCTCTGGCTCTG6ATCCTACTAAAGGCTACATCTACT 

GGACTGAGTGGGGTGGCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACC 

AATTGTAT6ACACTGGTAQACAAGGTGGGCCGGGCCAACGACCTCACCATTGA . 

TTATGCCGACCAGCGACTGTACTGGACTGACCTGGACACCAACATGATTGAGT 

CTTCCAACATGCTGGGTCAGGAGCGCATGGTGATAGCTGACGATCTGCCCTAC 

CCGTTTGGCCTGACTCAATATAGCGATTACATCTACTGGACTGACTGGAACCT-.... 

GCATAGCATTGAACGGGCGGACAAGACCAGTGGGCGGAACCGCACCCTCATCC 

AGGGTCACCTGGACTTCGTCATGGACATCCTGGTGTTCCACTCCTCCCGTCAG 

GATGGCCTCAACGACTGCGTGCACAGCAATGGCCAGTGTGG6CAGCTGTGCCT 

CGCCATCCCCGGAGGCCACCGCTGTGGCTGTGCTTCACACTACACGCTGGACC 

CCAGCAGCC6CAACTGCAGCCC6CCCTCCACCTTCTTGCT6TTCAGCCAGAAA 

TTTGCCATCAGCCGGATGATCCCCGATGACCAGCTCAGCCCGGACCTTGTCCT 

ACCCCTTCATGGGCTGAGGAACGTCAAAGCCATCAACTATGACCCGCTGGACA 

AGTTCATCTACTGGGTGGACGGGCGCCAGAACATCAAGAGGGCCAAGGACGAC 

GGTACCCAGCCCTCCATGCTGACCTCTCCCAGCCAAAGCCTGAGCCCAGACAG 

ACAGCCACACGACCTCAGCATTGACATCTACAGCCGGACACTGTTCTG6ACCT 

GTGAGGCCACCAACACTATCAATGTCCACCGGCTGGATGGGGATGCCATGGGA 

GTGGTGCTTCGAGGGGACCGTGACAAGCCAA66GCCATTGCTGTCAATGCTGA 

GCGAGGGTACATGTACTTTACCAACATGCAGGACCATGCTGCCAAGATCGAGC 

GAGCCTCCCTGGATGGCACAGAGCGGGAGGTCCTCTTCACCACAGGCCTCATC 

CGTCCCGTGGCCCTTGTGGTGGACAATGCTCTGGGCAAGCTCTTCTGGGTGGA 

TGCCGACCTAAAGCGAATCGAAAGCTGTGACCTCTCTGGGGCCAACCGCCTGA 

CCCTGGAAGATGCCAACATCGTACAGCCAGTAGGTCTGACAGTGCTGGGCAGG 

CACCTCTACTGGATCGACCGCCAGCAGCAGATGATCGAGCGCGTGGAGAAGAC 

CACTGGGGACAAGCGQACTAGGGTTCAGGGCCGTGTCACCCACCTGACAGGCA 

TCCATGCCGTGGAGGAAGTCAGCCTGGAGGAGTTCTCAGCCCATCCTTGTGCC 

CGAGACAATG6CGGCTGCTCCCACATCTGTATCGCCAAGGGTGATGGAACACC 

GCGCTGCTCGTGCCCTGTCCACCTGGTGCTCCTGCAGAACCTGCTGACTTGTG 

GTGAGCCTCCTACCTGCTCCCCTGATCAGTTTGCATGTACCACTGGTGAGATC 

GACTGCATCCCCGGAGCCTGGCGCTGTGACGGCTTCCCTGAGTGT6CTGACCA 

GAGTGATGAAGAAGGCTGCCCAGTGTGCTCCGCCTCTCAGTTCCCCTGCGCTC 

GAGGCCAGTGTGTGGACCTGCGGTTACGCTGCGACGGTGAGGCCGACTGCCAG 

GATCGCTCTGATGAAGCTAACTGCGATGCTGTCTGTCTGCCCAATCAGTTCCG 

GTGCACCAGCGGCCAGTGTGTCCTCATCAAGCAACAGTGTGACTCCTTCCCCG 

ACTGTGCTGATGGGTCTGATGAGCTCATGTGTGAAATCAACAAGCCACCCTCT 

GATGACATCCCAGCCCACAGCA6TGCCATTGGGCCCGTCATTGGTATCATCCT 

CTCCCTCTTCGTCATGGGCG6GGTCTACTTTGTCTGCCAGCGTGTGATGTGCC 
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AGCGCTACACAGGGGCCAGTGGGCCCTTTCCCCACGAGTATGTTGGTGGAGCC 
CCTCATGTGCCTCTCAACTTCATAGCCCCAGGTGGCTCACAGCACGGTCCCTT 
CCCAGGCATCCCGTGCAGCAAGTCCGTGATGAGCTCCATGAGCCTGGTGGGGG 
GGCGCGGCAGCGTGCCCCTCTATGACCGGAATCACGTCACTGGGGCCTCATCC 
AGCAGCTCGTCCAGCACAAAGGCCACACTATATCCGCCGATCCTGAACCCACC 
CCCGTCCCCGGCCACAGACCCCTCTCTCTACAACGTG6AC6TGTTTTATTCTT 
CAGGCATCCCGGCCACCGCTAGACCATACAGGCCCTACGTCATTCGAGGTATG 
GCACCCCCAACAACACCGTGCAGCACAGATGTGTGTGACAGTGACTACAGCAT 
CAGTCGCTGGAAGAGCAGCAAATACTACCTGGACTTGAATTCGGACTCAGACC 
CCTACCCCCCCCCGCCCACCCCCCACAGCCAGTACCTATCTGCAGAGGACAGC 
TGCCCACCCTCACCAGGCACTGAGAGGAGTTACTGCCACCTCTTCCCGCCCCC 
ACCGTCCCCCTGCACGGACTCGTCCTGACCTCGGCCGTCCACCCGGCCCTGCT 
GCCTCCCTGTAAATATTTTTAAATATGAACAAAGGAAAAATATATTTTATGAT 
TTAAAAAATAAATATAATTGGGGTTTTTAACAAGTGA6AAATGTGAGCGGTGA 
AGGGGTGGGCAGGGCTGGGAAACTTTTCTAG (SEQ ID NO: 7) 
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METAPTRAPPPPPPPLLLLVLYCSLVPAAASPLLLFANRRDVELVDAGGVKLE 

STIVASGLEDAAAVDFQFSKGAVYWTWSEEAIKQTYLNQTGAAAQKEVISGL 

VSPDGLACDWGKEO^yWTDSBTmiEVANLNGTSRECVLFWQDLDQPRAIALDP 

AHGYlOTirTDWGEAPRIERAGMDGSTRKIIVDSDIYWPNGLTIDLEEQKLyWAD 

AKLSFIHRAiaLDGSFRQKVVEGSLTHPFALTLSGDTLYWTDWQTRSIHACNK^ 

TGEQRKEILSALYSPMDIQVLSQERQPPFHTPCEEDNGGCSHLCLLSPREPFY 

S CACPTGVQLQDNGKTCKTGAEEVLLLARRTDLRRI SLDTPDFTDIVLQVGDI 

I^IAIDYDPLEGYVYWTDDEVRAIKEIAYLDGSGAQTLVNTEIIOTDPIXJIA 

VARNLYTIWDTGTDRIEVTRLNGTSRKILVSEDLDEPRAIVLHPVMGI^^ 

GENPKIECAKTLDGRDRHVLVNTSLGWPNGLALDLQEGKLYWGDAKTDKIEVIW 

IDGTKRKTLLEDKLPHIFGFTLLGDFIYWTDWQRRSIERVHKVKASRDVIIDQ 

LPDmGLKAVNVAEWGTNPCADGNGGCSHLCFFTPRATKCGCPIGLELLSDM 

KTCIIPEAFLWTSRATIHRISLETNNiroVAIPLTGVKEASALDFDVSI^ 

WTDVSLKTISRAFlMGSSVEHVIEPGLDyPEGMAVDWMGK]^ 

VARLmQPRQVLVWRDLDlIPRSLALDPTKGYIYWTEWGGKPRIVRAF^ 

MTLVDKVGRAMDLTIDYADQRLYWTDIJDTmiESSKIlffl^ 

GLTQYSDYIYWTDWlTLHSIERADKTSGRmTLIQGHLDFVMDILVFHSSRQDG 

LNDCVHSNGQCGQLCLAIPGGHRCGCASHYTLDPSSRNCSPPSTFLLFSQKFA 

ISRMIPDDQLSPDLVLPLHGLRlJVKAIim)PLDKPIYWVI)GRQNIKRAEa)DGT 

QPSMLTSPSQSLSPDRQPHDLSIDIYSRTLFWTCE^Tl^INVHRLDGDlkMGW 

LRGDRDKPRAIAWAERGYMYFTIMQDHAAKIERASLDGTEREVLPTTGLIRP 

VALVVDNALGKLFWVDADLKRIESCDLSGANRLTLEDANIVQPVGLTVLGI^ 

YWIDRQQQMIERVEKTTGDKRTRVQGRVTHLTGIHAVEEVSLEEFSAHPCARD 

NGGCSHICIAKGDGTPRCSCPVHLVLLQNLLTCGEPPTCSPDQFACTTGEIDC. 

IPGAWRCDGFPECADQSDEEGCPVCSASQFPCAR6QCVDLRLRCDGEADCQDR 

SDEANCDAVCLPNQFRCTSGQCVLIKQQCDSFPDCADGSDELMCEIKTKPPSDD 

IPAHSSAIGPVIGIILSLFVMGGVYFVCQRVMCQRYTGASGPFPHEYVGGAPH 

VPLNFIAPGGSQHGPFPGIPCSKSVMSSMSLVGGRGSVPLYDRISIHVTGASSSS 

SSSTKATLYPPILNPPPSPATDPSLYIWDVFYSSGIPATARPYRPYVTRGMAP 

PTTPCSTDVCDSDYSISRWKSSKYYLDLNSDSDPYPPPPTPHSQYLSAEDSCP 

PSPGTERSYCHLFPPPPSPCTDSS (SEQ ID NO: 8) 



wo 01/98508 



13/23 



PCT/USO 1/20015 



Construct 



Gene: 193 Gl Number(s): 6678715 

Gene Family: EGF domain protein 

Gene 

Subf ami ly ' Low-density lipoprotein receptor 
Gene Sequence: fuH-lengfh cDNA, Mouse 



underlined = deleted in targeting construct 

[1^ sequence flanking Neo insert in targeting construct 



GCCGCGGCGCCCGAGGCGGGAGCAAGAGGCGCCGGGAGCCGCGAGOATCCACCGCCGCCG 

CGCGCGCCATGGAGCCCGAGTGAGCGCGCGGCGCTCCCGGCCGCCGGACGACATGGAAAC 

GGCGCCGACX:CGGGCCCCTCCGCCGCCGCCGCCGCCGCTGCTGCa?GCTGGTGCTGTACTG 

CAGCTTGGTCCCCGCCGCGGCCTCACCGCTCCTGTTGTTTGCCAACCGCCGGGATGTGCG 

GCTAGTGGATGCCGGCGGAGTGAAGCTGGAGTCCACCATTGTGGCCAGTGGCCTGGAGGA 

TGCAGCTCCTGTAGACTTCCAGTTOTCCAAGGGTGCTGTGTACTGGACAGATGTGAQ 

GGAGGCCMCAAACAGACCO'ACCTGAACCAGACTGGAGCrGCTGCACAGAACATOGTCAT 

CTCGGGCCTCGTGTCACCTGATCGCCTGGCCT6TGACTGGGTTOGCAAGAAGCTGTACTG 

GACGGACTCCGAGACCAACCGCATTGAGGTTGCCAACCTCAATGGGACGTCCCGTAAGGT 

TCTCTTCTGGCA6GACCa!GGACCAGCCAAGGGCCATTGCCCTGGATCCTGCACATGGGTA 

C7lTGTACTGGACTGACa?GGGGGGAAGCACCCCGGATCGAGCGGGCAGGGATGGATCGCAG 

TACCCGGTAGATCAOTGTAGACTCCGACAOTTACTGGCCCi^TGGGOTG^ 

6GAGGAACAGAAGCTGTACTGGGCCGATGCCAAGCTCAGCTTCATCCACCGTGCCAACCT 

GGACGGCTCCMCCGGCAGAAGGTGGTGGAGGGCAGCCTCACTCACCCTTTTGCCC^^ 

ACTCTCTGGGGACACACTCTACTGGACAGACTGGCAGACCCGCTCCATCCACGCCTGCAA 

CAAGTGGACAGGGGAGCAGAGGAAGGAGATCCTTAGTGCTCTCXACTCACCCATGGAC^ 

CCAAGTGCTGAGCCAGGAGCGGCAGCCTCCCTTCCACACACCaLTGCGAGGAGGACAACGG 

TGGCTGTTCCCaCCTGTGCCTGCTGTCCCCGAGGGAGCCTOTCTACTCCTGTGCCTGCCC 

CACTGGTGTGCAGTTGCAGGACAATGGCAAGACGTGCAAGACAGGGGCTGAGGAAGrGCT 

GCTGCTGGCTCGGAGGACAGACCTGAGGAGGATCTCTCTGGACACCCCTGACTTCACAGA 

CATAGTGCTGCAGGTGGGCGACATCCGGCATGCCATTGCCATTGACTACGATCCCCTGGA 

GGGCTACGTGTACTGGACCGATGAOJGAGGTGCGGGCTATCCGCAGGGCGTACCTAGATGG 

CTGAGGTGCGCAG^CACTO'GTCAACACTGAGATCAATGACCCCGATGGCATTGCTGTGGA 

CTGGGTCGCCCGGAACCTCTACTGGACAGATACAGGCACTGACAGAATTGAGG'rGACTCG 

CCTCAACGGCACCTCCCGAAAGATCCTCGTATCTGAGGACCT6GACGAACCGCGAGCCAT 

TGTGTTGCACCCTGTGATGGGCCTCATCTACTGGACAGACTGGGGGGAGAACCCCAAAAT 

CGAATGCGCCAACCTAGATGGGAGAGATCGGCATGTCCTGGTGAACACCTCCCa'TGGGTG 

GCCCAATGGACTGGCCCTGGACCTGCAGGAGGdC?^GCTGTACTGGGGGGATGCC:yiA^^ 

TGATAAAATCGAGGTGATCAACATAGACGGGACAAAGCGGAAGACCCTGCTTGAGGAC^ 

GCTCCCACACATTTOTGGGTTCACACTCCTOGGGGACTTCATCTACTGGACCGACTC^ 

GAGACGCAGTATTGAAAGGGTCCACAAGGrC?^GGCCAGCCGGGATGTCATCAa?TGATCA 

ACO^CCGACCTCATCGGACTCAAAGCCGTGAATGTGGCCAAGGTTGTCGGAACCAACCC 

ATGTCCGGATGGAAATGGAGGGTGCAGCCATCTGraCTTCn'CACCCCACGTGCCACCaA 

GTGTGGCTGCCCCATTGGCCTGGAGCTGTTGAGTGACATGAAGACCTQCATAATCCCCGA 

GGCCTTCCTGGTATOXIACCAGCAGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAA 

CAACGATCrGGCrATCCCACa?CACGGGTGTCAAAGAGGCCrCTGCACTGGACTTTGATGa? 

GTCCAACAATCAaiTCTACa?G6ACTGATGTrAGCCTCAAGACGATCAGCCGAGCCTTCAT 

GAATGGGAGCTCAGTGGAGCACGTGATTCA6TTTGGCCTCGACTACCCTGAAGGAATGGC 

TGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACaUiLCAGGA 

GGCCCGGCTGGArGGGCAGraCCGGCAGGTGCTa?GTGTGGAGAGACCTTGACAACCCCAG 

GTCTCTGGCTCTGGATCCTACTAAAGGCTACATCTACTGGACTGAGTGGGGTCGCAAGCC 

AAGGATTGTGCGGGCCTa?CyVTGGATGGGACCAATTGTATGACACTGGTAGACAAGGaG^ 

CCGGGCCAACGACC'TCACCATTGATTATGCCGACCAGCGACTGTACTGGACTGACC'EGGA 
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TCTGCCCTACCCGTTTGGCCTGACTCAATATAGCGATTACATCTACTGGACTGACTGGAA 

CCTGCATAGCATTGAACGGGCGGACAAGACCAGTGGGCGGAACCGCACCCTCATCCAGGG 

TCACCTGGACTTCGTCATGGACATCCTGGTGTTCCACTCCTCCCGTCAGGATGGCCTCAA 

CGACTGCGTGCACAGCAATGGCCAGTGTGGGCAGCTGTGCCTCGCCATCCCCGGAGGCCA 

CCGCTGTGGCTGTGCTTCACACTACACGCTGGACCCCAGCAGCCGCAACTGCAGCCCGCC 

CTCCACCrrCTTGCTGTOXIAGCCAGAAATTTGCCATCAGCCGGATGATCCCCGATGACCA 

GCaK:AGCCCGGACCTaK3TCCTACCCCTTCAaX3GGCTGAGGAACGTCAAAG^ 

TGACCCGCTGGACAAGTTCATCTACTGGGTGGACGGGC6CCAGAACATCAA6AGGGCCAA 

GGACGACGGTACCCAGCCCTCCATGCTGACCTCTCCCAGCCAAAGCCTGAGCCCAGACAG 

ACAGCCACACGACCTCAGCATTGAC7VTCTAC71GCCGGACACTGTTCTGGACOTGTGAGGC 

CACCAACACTATCAATGTCCACCGGCTGGATGGGGATGCCATGGGAGTGGTGCTTCGAG6 

GGACCGTGACAAGCCAAGGGCCAa?TGCaX5TCAATGCTGAGCGAGGGTACATGTACTTTAC 

CaACArGCAGGACCATGCa?GCCAAGATa3AGCGAGCCTCCCTGGATGGC^ 

GGTCCa?CrTCACCACAGGCCTCATCCGTCCCGTGGCCCTTGTGGTGGACAATCCTCTC^ 

CAAGCTCTTCOXSGGmSATGCCGACCTAAAGCGAATCGAAAGCTGTGACCTCTCTG [ GGG 

CCAACCGCCTGACCCTGGAAGATGCCAAGATCGTACAGCCAGTAGGOXITCACAGTG^^ 

GCAGGCACCTCTACTGGATCGACCGCCAGCAGCAGATGATCGAGCGCGa?GGAGAAGACC] 

ACTGGGGACAAGCGGACTAGGGTOCAGGGCCGTGTCACCCACC [ TGACAGGCATCC ATGC 

CGTGGAGGAAGTCAGCCTGGAGGAGTTCT]CAGCCCATCCTTGa^CCGAGACAA 

GCrGCTCCCACATCTGTArCGCCAAGGGa^a?GGAACACCGCGCTGCTCGTGCCCT^ 

ACCTOGa?GCTCCTGCAGAACCTGCTGACTTGTGGTGAGCCTCCTACCTGCTCCCCTGATC 

AGTTTGCATGTACCACTGGOIGAGATCGACTGCATCCCCGGAGCCTGGCGCTGTGACGGCT 

TCCCTGAGTGTGCTGACCAGAGTGATGAAGAAGGCTGCCCAGTGTGCTCCGCCTCTCAGT 

TCCCCTGCGCa?CGAGGCCAGTGTGTGGACCTGCGGTTACGCTGCGACGGTGAGGCCGACT 

GCCAGGATCGCTCTGATGAAGCTAACOXSCGATGOTGTCTGTCTGCCCAATCAGTTCCGGT 

GCACCAGCGGCCUlGTGTGrCCTCATCAAGCAACAGTGTGACTCCTTCCCCGACTGTGCTG 

ATGGGTCTGATGAGCTCATGTGTGAAATCAACAAGCCACCCTCTGATGACATCCC^^ 

ACAGCAGTGCCATTGGGCCCGTCAa?TGGTATCATCCTCTCCCTCTTCGTCATGGGCGGGG 

TCTACTTTGTCTCCCAGCGTGTGATGTGCCAGCGCTACACAGGGQCCAGTGGGCCCTTTC 

CCCACGAGTATGTTGGTGGAGCCCCTCATGTGCCTCTCAACTTCATAGCCCCAGGTGGCT 

CACAGCACGGTCCCTTCCCAGGC:ATCCCGTOCAGCAAGTCCGa?GATGAGCTCCATGAGCC 

TGGTGGGGGGGCGCGGCAGCGTGCCCCTCrATGACCGGAATCACGTCACTGGGGCCTCAT 

CCAGCAGCTCGTCCAGCaCAAAGGCCACACTATATCCGCCQATCCTGAACCCACCCCCGT 

CCCCGGCCACAGACCCCTCTCTCTACAACGTGGaCGTGTTTTATTCTTCAGGCATCCCM 

CCACCGCTAGACCATACAGGCCCTACGTCATTCGAGGTATGGCACCCCCAACAACACCGT 

GCAGCACAGATGTGTGTGACAGTGACTACAGCyiTCAGTCGCTGGAAGAGCAGCAAATACT 

ACCTGGACTrGAATTCGGACTCAGACCCCTACCCCCCCCCGCCCACCCCCCACAGCCAGT 

ACCTATCTGCAGAGGACAGCTGCCCACCCTCACCAGGCACTGAQAGGAGTTACTGCCACC 

TCTOCCCGCCCCCACCGa?CCCCCTGC^CGGACrcGTCCTGACCTCGGCCGTCCACCCG^ 

CCTGCTGCCTCCCTGTAAATATTTTTAAATATGAACaAAGGAAAAATATATTTTATGATT 

TAAAAAATAAATATAATOGGGGTTCTTAACAAGTGAGAAATGTGAGCGGTGAAGGGGTGG 

GCAGGGCTGGGAAACTTTTCTAG 



Gene Sequence 
Structure * 



3659 bp 



Sequence Deleted 



3701 bp 



Size of full-length 
cDNA: 5119 bp 
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Targeting Vector* (genomic sequence) 
Construct Number: 992 



Arm Length: 
y:1.5kb 
3*: 2.9 kb 



'5* arm 



Neo 
Cassette 



3' ami 



— - — — » fEndo^nousjliociis 
* Not drawn to scale 




5 ' >AAATATGCATTATCCTGAGCA 
CAGTGGGTCTGGCCCraCACTTGG 
CtGCCACTCaTGGAGCCTTTATGC 
TAACCACAGGGGCCAACCGCCiTGa 
CCC!rGG2UlGATGCCAACAa?CGTAC 
AGCCAGTAGGTCTGACAGTGCTGG 
GCAGGCRCCTCTACTGGATCGACC 
GCCAGCAGCAGATGATaSAGCGTG 
TGGAGAAGACCO ' 
(SEQ ID NO: 9) 



5 ' >TCACTGGCATCCAa?GCAGTG: 
AGGAAGTCAGCCTGGAGGAGTTCT 
GTACGTGAGAGGGGACAGTGTOTG 
TGGO'GGGGTCTCCTGGGGGAAGGT 
GAATCAGCCCTACTG6CATCAGAT 
GGGCTGCTGGTGCAAGAGCAGTGT 
GCCTGAGGAGCTCATGGGCa?CAGC i 
ACCGAAGGCCAGOXSCATGTCCAGA , 

TGTca!Gcca?ca?<3 ' 

(SEQ ID NO: 10) 
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GGGGGGGGGGGGGGTCAGAGGGAGCTTTCTTTTAGGCCCGTCCATCTGTGAAT 

CTAACCTCAGTTTCTGGGAATCAGGAAGCATGCATCTCCTCTTAGTTCAGCTG 

CTTGTTCTCTTGCCTCTGGGGAA6GCAGACCTATGTGTGGATGGCTGCCAGAG 

TCAGGGCTCTTTATCCTTTCCTCTCCTAGAAAGGGGTCGCAGAGATCTCCACG 

TGGCCAACCACGAGGAGGCA6AAGACAAGCCGGATCTGTTTGTGGCCGTGCCA 

CACCTCATGGGCACCAGCCTGGCTGGGGAAGGCCAGAGGCAGAGAGGGAAGAT 

GCTGTCCAGGCTTGGAAGATTCTGGAAGAAACCTGAGACCGAATTTTACCCCC 

CAAGGGATGTGGAAAGCGATCATGTCTCATCGG6GAT6CAGGCCGTGACTCAG 

CCAGCAGATGGGAGGAAAGTGGAGAGATCACCTCTACAGGAGGAAGCCAAGAG 

GTTCTGGCATCGGTTCATGTTCAGAAAGGGCCCGGCGTTCCAGGGAGTCATCC 

TGCCCATCAAAAGCCACGAAGTACACTGGGAGAC.CTGCAGGACTGTGCCCTTC 

AACCAGACCATTGCCCATGAAGACTGTCAAAAAGTCGTTGTCCAGAACAACCT 

TTGCTTTGGCAAATGCAGTTCCATTCGTTTTCCCGGAGAAGGGGCAGATGCCC 

ACAGCTTCTGCTCCCACTGCTCGCCCACCAAATTCACCACCGTGCACTTGATG 

CTGAACTGCACCAGCCCAACCCCCGTGGTCAAGATGGTGATGCAAGTAGAAGA 

GTGTCAGTGCATGGTGAAGACGGAACGTG6AGAGGAGCGCCTCCTACTGGCTG 

GTTCCCAGGGTTCCTTCATCCCTGGACTTCCAGCTTCAAAAACAAACCCATGA 

ATTACCTCAACAGAAAGCAAAACCTCAACAGAATAAGTGAGGGTTATTCAATC 

TGGAAATGTTATGTGAGTTATATAAAGATCAGTGGAAAATATCTTTCTCTCTC 

CCTCTCTCCCCCTCTCTCTTCTCTCTATTTTCTCTCTCTCTCTCTGTCTCTCT 

CTCTCTCTCTCTCTCTCTCACACACACACACACACACACACACACACACACAT 

GTTTGTGTTTAGACAGGGTCTTATGTATTCTCAGCTGGCCTCAAACTCACAAT 

GTGGCTGGGGATGATTTTAAACTCCTGATCCAATTCCTGAGTGCTGGGATTAC 

AGACATGCTCCATAAimCATAGCTCCCAGAAGGATrTTTAAAAGAGATTTTGC 

ATGTTTCAAAGTTGCCTTTGAGACTCAGAAATATTTTGATI3TATTGAATGGCC 

TTGCCACAGATGTGGGAGGCAGCTTGCTTGGTGGCCCAAGTATTTTTTTTTTG 

TTCGTTCAGAATTCTCCACATGAAGTTTTTACTGTTGGTTATCTGGCGTTGAA 

Giy^GGAATAGTGAAGGTACTTTTAACAGTTTACACGTGGAAGGGGCTCAGGCA 

CTAG6AACCAACCTTTTCCCGGAATATGAGGAAAATACATGAACAGTATTAGA 

GTCACTTGAGGAAGTTACTAGGAAACGCCATAAGTCTCCAAGTACATTGTGAG 

TCATTTTGAAGGACAATCGTGTATATAGACGAAATCTTCTACTCGTATGCTTT 

TGAATCTTCTAGCAAGTTAGGTTTCTAT6TTTGGGCTTCTTCCTATTGTCTAA 

GAGTATGTGTGACAAATTCAACCTGACAAATACCTCAATGGCAAATTCTGACC 

CTG (SEQ ID NO: 11) 

PfflLLLVQLLVLLPLGKADLCVDGCQSQGSLSFPLLERGKEyDLHV-ANHEE^ 

PDLFVAWHIJJIGTSLAGEGQRQRGKMLSRLGRFWKKPETEFyPPRDV^ 

SGMQAVTQPAIXSRKVERSPLQEEAKRFWHREMFRKGPAFQGVILPIKSHEVHW 

ETCRTVPFNQTIAHEDCQKVWQNNLCFGKCSSIRFPGEGADAHSFCSHCSPT 

EOFTTVmmNCTSPTPVViavEVMQVEECQCMVKTERGEER^ 

PASKTNP (SEQ ID NO: 12) 
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Construct 



Gene: 67 Gl Number(s): 6753409 

Gene Fami ly : Growfh Factor Inhibitor 

Gene ^ , 

« , £ Cerberus 
Subfamily: 

Gene Sequence: full-length cDNA, Mouse 



underlined - deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[GGGGGGGGGGGGGGTCAGAGGGAGCTTTCTTTrAGGCCCGTCCATCTGTGAATCTAACC 
TCAGTrrCTGGGAATCAGGAAGCATGCATCTCCTCTOAGTTCAGCTGCTTGTTC 
CTCTGGGGAAGGCAGACCTATGTGTGGAT6GCTGCCAGAGa?CAGGGCrcTTTATC^^ 
CCTCCTAGAAAGGGGTCGCAGAGATCTCCACGTGGCCAACCACGAGGAGGCAGAAGACAA 

] GCCGGATCTGTraGrGGCCGTGCCACACCTCAI^GCACCAGCC!r^CTGG(^ 

AGAGGCAGAGAGGGAAGATGCTGTCCAGGCTTGGAAGATTCTGGAAGA7VACCTGAGACCG 

AATTTrACCCCCCAAGGGATGTGGAAAGCGATCATGTCTCATCGGGGATGCAGGCCGTGA 

CTCAGCCAGCAGATGGGAGGAAAQTGGAGAGATCACCTCTACAGGAGGAAGCCAAGAGGT 

TCTGGCATCGGTTCATGTTCAGAAAGGGCCCGGCGTTCCAGGGAGTCAT [CCTGCCCATC 

AAAAGCCACGAAGTACACTGGGAGACCTGCAGGACTGTGCCCTTCAACCAG]ACCa.^ 

CCAaXJAAGACTGTCAAAAAGTCGa^TCCAGAACAACCTraGCTa?a?GGC^ 

CATTCGTTMCCCGGAGAAGGGGCAGATGCCCACAGCTTCTGCTCCCACTGCTCGCCCAC 

CAAATTCACCACCGTGCACTTGATGCTGAACTGCACCAGCCCAACCCCCGTGGTC^ 

GGTCATGCAAGTAGAAGAGrercAGTGCAO^TOAAGACGGaACGraGAGAC^ 

CCTACTGGCTGGTTCCCAGGOTTCCTTCATCCCTGGACTTCCAGCTTCAAAAACAAACCC 

ATGAATTACCTCAACAGAAAGCAAAACCTCAACAGAATAAGTGaGGGraAraC?lATC 

AAATGTrATGTGAGraATATAAAGATCAG(r^AAAATATCTa'ax:^ 

CCTCTCTCTTCTCTCTATTTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 

TCTkCACACACACACACAGACACACACACACACACATGTOTGTGOJTTAGACA^ 

GTATTCTCAGCTGGCCTCAAACTCACAATGTGGCTGGGGATGATTTTAAACTCCT 

AATTCCTGAGTGCTGGGATTACAGACATGCTCCATAANACATAGCTCCCAGAAGGAOT^ 

TAAAAGAGATTTTOCATGTTTCAAAGTaxSCCTTTGAGACTCAGAAATATTTTGAOTTATr 

GAATGGCCTTGCCyiCTlGATGTGGGAGGCAGCTTGCTTGGTGGCCCAAGTATraTra 

GTTCGTTCAGAAMCTCCACATGAAGTTTTTACTGTTGGTTATCTGGCGTTGAAGAAGGA 

ATAGTGAAGGTACTOTTAACAGTTTACACGTGGAAGGGGCTCAGGCACTAGGAACCAACC 

TTTTCCCGGAATATGAG6ARAATACATGAACAGTATTAGAGTCACTTGAGGAAGTTACTA 

GGAAACGCCATAAGTCTCCAAGTACATTCTGAGTCATTtPTGAAGGACy^TCGTC^^ 

GACGAAATCTTC^rACTCGTATGCTTTTGAATCrrcTAGCAAGTTAGGTOTCT^^ 

6CTTCTTCCTATTGTCTAA6AGTATGTGTGACAAATTCAACCTGACAAATACCTCAAT6G 

CAAATTCTGACCCTG 



Gene Sequence 
Structure * 



241 bp 



Sequence Deleted 



528 bp 



Size of fulWength 
cDNA: 1752 bp 
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Targeting Vector* (genomic sequence) 
Construct Number: 269 



Arm Length: 

5':2kb 

3':1.7kb 



Neo 
iCassBtte 



3' ami 




* Not drawn to scale 



5«>TCCATC!rGTGAATCTAACCTC , 

AGTCTCTGGGAATCAC3GAAGCATG [ 

CATCTCCTCTOAGTTCAGCTGCTT 

GTOCTCTOGCCTCTGGGGAAGGCA 

G&CCTATGTGTGGA.TGGCT'GCCAG 

AGTCAGGGCTCTTTATCCTTTCCT 

CTCCCAQAAAGGGGTCGCAGAGAT 

CTCCACGOKSGCCAACCACGAGGAG 

GCA(3AAGACAA<3 ' 

(SEQ ID 3SrO:13) 



5 ' >CCTGCCCATCAAAAGCCACGA 

AGTACACTGGGAGACCTGCAGGAC 

TGTGCCCTTCAACCAGGTATGCAT 

ax:TAQAGGGTAAACCACCAGTa?TG 

CCAGACAGGGAGGACAGCT6QACA 

GCTAGGACAAACGGCAAAATAGAA 

AGAGTCTGGCGAGAGCTCGGGCCT 

TGTCTAGTTCCAGATTCAGTCCTT 

TGGGTATTCATO ■ 

(SEQ XD NO: 14) 
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TCGGGGCTAGGGCTTGTGGGTAGCT (SEQ ID NO: 15) 
ACAATTCTGAAAACTGGTCCCTGGT (SEQ ID NO: 16) 



GTTCCTGGATGGCTGGGACCCANCCAGGACCTCCTGGTGGGAGATGTCATCCG 
CCAAGCCCTGCCCAACAGGAACACTAAGGTCAAATACTTCATCCCACCCTCAA 
TGTACAGGGCCACCCACTACCCACCCTATGCTGGTGGGGGAGGATATGTCATG 
TCCAGAGCCACAGTGCGGCGCCTCCAGGCTATCATGGAAGATGCTGAACTCTT 
CCCCATTGATGATGTCTTTGTGGGTATGTGCCTGAGGAGGCTGGGGCTGAGCC 
TATTGCACCATGCTGGCTTCAAGACATTTGGAATCCGGCGGCCCCTGGACCCC 
TTAGACCCCTGCCTGTATAGGGGGCTCCTGCTGGTTCACCGCCTCAGCCCCCT 
CGAGATGTGGACCATGTGGGCACTGGTGACAGATGAGGGGCTCAAGTGTGCAG 
CTGGCCCCATACCCCAGCGCTGAAGG6TGGGTTGGGCAACAGCTGAGAGTGGA 
CTCAGTGTTGATTCTCTATCGTGATGCGAAATTGATGCCTGCTGCTCTACAGA 
AAATGCCAACTGGGTTTTAACTCCTCTCACCNGTTAGWTCGATTANAACACTG 
CAACCCAAAAAAAAZy^AAAAANCGGGGGGTTTTTTTGTCTTTTTTTTTTTT^ 
TAGAGNGCGCAACGT (SEQ ID NO: 17) 




